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Abstract 

Data and procedures and the values they amass, 
Higher-order functions to combine and mix and match, 
Objects with their local state, the messages they pass, 
A property, a package, the control point for a catch — 
In the Lambda Order they are all first-class. 
One Thing to name them all, One Thing to define them, 
One Thing to place them in environments and bind them, 
In the Lambda Order they are all first-class. 



Keywords: Scheme, Lisp, functional programming, computer languages. 



This report describes research done at the Artificial Intelligence Laboratory of the Mas- 
sachusetts Institute of Technology and the Computer Science Department of Indiana 
University. Support for the MIT research is provided in part by the Advanced Research 
Projects Agency of the Department of Defense under Office of Naval Research contracts 
N00014-80-C-0505 and N00014-85-K-0124. Support for the Indiana University research 
is provided by NSP grants NCS 83-04567 and NCS 83-03325. This report is published 
as Computer Science Department Technical Report 174, October 1986. 



With apologies to J. R. R. Tolkien. 



1. Overview of Scheme 



DESCRIPTION OF THE LANGUAGE 



1. Overview of Scheme 

1.1. Semantics 

This section gives an overview of Scheme's semantics. A 
detailed informal semantics is the subject of chapters 3 
through 6. For reference purposes, section 7.2 provides a 
formal semantics of Scheme. 

Following Algol, Scheme is a statically scoped program- 
ming language. Each use of a variable is associated with a 
lexically apparent binding of that variable. 

Scheme has latent as opposed to manifest types. Types 
are associated with values (also called objects) rather than 
with variables. (Some authors refer to languages with 
latent types as weakly typed or dynamically typed lan- 
guages.) Other languages with latent types are APL, 
Snobol, and other dialects of Lisp. Languages with mani- 
fest types (sometimes referred to as strongly typed or stat- 
ically typed languages) include Algol 60, Pascal, and C. 

All objects created in the course of a Scheme computation, 
including procedures and continuations, have unlimited ex- 
tent. No Scheme object is ever destroyed. The reason that 
implementations of Scheme do not (usually!) run out of 
storage is that they are permitted to reclaim the storage 
occupied by an object if they can prove that the object 
cannot possibly matter to any future computation. Other 
languages in which most objects have unlimited extent in- 
clude APL and other Lisp dialects. 

Implementations of Scheme are required to be properly 
tail-recursive. This allows the execution of an iterative 
computation in constant space, even if the iterative compu- 
tation is described by a syntactically recursive procedure. 
Thus with a tail-recursive implementation, iteration can be 
expressed using the ordinary procedure-call mechanics, so 
that special iteration constructs are useful only as syntactic 
sugar. 

Scheme procedures are objects in their own right. Pro- 
cedures can be created dynamically, stored in data struc- 
tures, returned as results of procedures, and so on. Other 
languages with these properties include Common Lisp and 
ML. 

One distinguishing feature of Scheme is that continuations, 
which in most other languages only operate behind the 
scenes, also have "first-class* status. Continuations are 
useful for implementing a wide variety of advanced control 
constructs, including non-local exits, backtracking, and 
coroutines. See section 6.9. 

Arguments to Scheme procedures are always passed by 
value, which means that the actual argument expressions 
are evaluated before the procedure gains control, whether 
the procedure needs the result of the evaluation or not. 



ML, C, and APL are three other languages that always 
pass arguments by value. This is distinct from the lazy- 
evaluation semantics of SASL, or the call-by-name seman- 
tics of Algol 60, where an argument expression is not eval- 
uated unless its value is needed by the procedure. 

1.2. Syntax 

Scheme employs a parenthesized-list Polish notation to de- 
scribe programs and (other) data. The syntax of Scheme, 
like that of most Lisp dialects, provides for great expressive 
power, largely due to its simplicity. An important conse- 
quence of this simplicity is the susceptibility of Scheme 
programs and data to uniform treatment by other Scheme 
programs. As with other Lisp dialects, the read primitive 
parses its input; that is, it performs syntactic as well as 
lexical decomposition of what it reads. 

1.3. Notation and terminology 

1.3.1. Essential and non-essential features 

It is required that every implementation of Scheme support 
features that are marked as being essential. Features not 
explicitly marked as essential are not essential. Implemen- 
tations are free to omit non-essential features of Scheme 
or to add extensions, provided the extensions are not in 
conflict with the language reported here. 

1.3.2. Error situations and unspecified behavior 

When speaking of an error situation, this report uses the 
phrase "an error is signalled" to indicate that implemen- 
tations must detect and report the error. If such wording 
does not appear in the discussion of an error, then imple- 
mentations are not required to detect or report the error, 
though they are encouraged to do so. An error situation 
that implementations are not required to detect is usually 
referred to simply as "an error." 

For example, it is an error for a procedure to be passed an 
argument that the procedure is not explicitly specified to 
handle, even though such domain errors are seldom men- 
tioned in this report. Implementations may extend a pro- 
cedure's domain of definition to include other arguments. 

If the value of an expression is said to be "unspecified," 
then the expression must evaluate to some object without 
signalling an error, but the value depends on the imple- 
mentation; this report explicitly does not say what value 
should be returned. 

1.3.3. Entry format 

Chapters 4 and 6 are organized into entries. Each entry de- 
scribes one language feature or a group of related features, 
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where a feature is either a syntactic construct or a built-in 
procedure. An entry begins with one or more header lines 
of the form 

template essential category 

if the feature is an essential feature, or simply 

template category 

if the feature is not an essential feature. 

If category is "syntax", the entry describes an expression 
type, and the header line gives the syntax of the expression 
type. Components of expressions are designated by syn- 
tactic variables, which are written using angle brackets, 
for example, (expression), (variable). Syntactic variables 
should be understood to denote segments of program text; 
for example, (expression) stands for any string of charac- 
ters which is a syntactically valid expression. The notation 

(thingi) . . . 
indicates zero or more occurrences of a (thing), and 

(thingi) (*hing 2 ) . . . 

indicates one or more occurrences of a (thing). 

If category is "procedure" , then the entry describes a pro- 
cedure, and the header line gives a template for a call to the 
procedure. Argument names in the template are italicized. 
Thus the header line 



(vector-ref vector k) 



essential procedure 



indicates that the essential built-in procedure vector-ref 
takes two arguments, a vector vector and an exact non- 
negative integer k (see below). The header lines 



(append listi 
(append list . 



lispz) 



essential procedure 
procedure 



indicate that in all implementations, the append procedure 
must be defined to take two arguments, and some imple- 
mentations will extend it to take zero or more arguments. 

It is an error for an operation to be presented with an ar- 
gument that it is not specified to handle. For succinctness, 
we follow the convention that if an argument name is also 
the name of a type, then this implies a restriction on the 
type of that argument to the procedure. For example, the 
header line for vector-ref given above dictates that first 
argument to vector-ref must be a vector. The following 
naming conventions also imply type restrictions: 

obj any object 

z, Z\, . . . zy, . . . complex, real, rational, integer 

x,x\, ... Xj, ... real, rational, integer 

y> !/i> •••%»•• • rea l> rational, integer 

q, q\, . . . q 3 -, . . . rational, integer 

n, ni, . . . ny, . . . integer 

k, ki, . . . kj, . . . exact non-negative integer 



1.3.4. Evaluation examples 

The symbol "=V used in program examples should be 
read "evaluates to." For example, 

(* 6 8) =► 40 

means that the expression (* 5 8) evaluates to the ob- 
ject 40. Or, more precisely: the expression given by the 
sequence of characters "(* 5 8)" evaluates, in the initial 
environment, to an object that may be represented exter- 
nally by the sequence of characters "40". See section 3.3 
for a discussion of external representations of objects. 

2. Lexical conventions 

This section gives an informal account of some of the lexical 
conventions used in writing Scheme programs. For a formal 
syntax of Scheme, see section 7.1. 

Upper and lower case forms of a letter are never distin- 
guished except within character and string constants. For 
example, Foo is the same identifier as F00, and #xlAB is 
the same number as #Xlab. 

2.1. Identifiers 

Most identifiers allowed by other programming languages 
are also acceptable to Scheme. The precise rules for form- 
ing identifiers vary among implementations of Scheme, but 
in all implementations a sequence of letters, digits, and "ex- 
tended alphabetic characters" that begins with a character 
that cannot begin a number is an identifier. In addition, + 
and - (which can begin numbers) are identifiers. Here are 
some examples of identifiers: 



lambda 
list->vector 

+ 



q 

soup 

V17a 

a34kTMNs 



the-word-recursion-has -many -meanings 

Extended alphabetic characters may be used in identifiers 
exactly as if they were letters. The following are extended 
alphabetic characters: 

*/<»>!?:$%_fc"~ 

See section 7.1.1 for a formal syntax of identifiers. 

Identifiers have several uses within Scheme programs: 

• Certain identifiers are reserved for use as syntactic 
keywords (see below). 

• Any identifier that is not a syntactic keyword may be 
used as a variable (see section 3.1). 

• When an identifier appears as a literal or within a 
literal (see section 4.1.2), it is being used to denote a 
symbol (see section 6.4). 
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=> 


do 


and 


else 


begin 


if 


case 


lambda 


cond 


let 


define 


let* 


delay 


letrec 



The following identifiers are syntactic keywords, and 
should not be used as variables: 



or 
quasiquote 

quote 

set! 

unquote 

unquote-splicing 



Some implementations allow all identifiers, including syn- 
tactic keywords, to be used as variables. This is a com- 
patible extension to the language, but ambiguities in the 
language result when the restriction is relaxed, and the 
ways in which these ambiguities are resolved vary between 
implementations. 

The characters ? and ! have no special properties — they 
are extended alphabetic characters. By convention, how- 
ever, most predicate procedures (those that return boolean 
values) are named by identifiers that end in ?, and most 
data mutation procedures are named by identifiers that 
end in ! . 

2.2. Whitespace and comments 

Whitespace characters are spaces and newlines. (Imple- 
mentations typically provide additional whitespace char- 
acters such as tab or page break.) Whitespace is used for 
improved readability and as necessary to separate tokens 
from each other, a token being an indivisible lexical unit 
such as an identifier or number, but is otherwise insignifi- 
cant. Whitespace may occur between any two tokens, but 
not within a token. Whitespace may also occur inside a 
string, where it is significant. 

A semicolon (;) indicates the start of a comment. The 
comment continues to the end of the line on which the 
semicolon appears. Comments are invisible to Scheme, but 
the end of the line is visible as whitespace. This prevents a 
comment from appearing in the middle of an identifier or 
number. 

; ; ; The FACT procedure computes the factorial 
;;; of a non-negative integer, 
(define fact 
(lambda (n) 
(if (- n 0) 

1 ;Base case: return 1 

(* n (fact (- n 1)))))) 

2.3. Other notations 

For a description of the notations used for numbers, see 
section 6.5. 

. + - These are used in numbers, and may also occur any- 
where in an identifier except as the first character. A 



delimited plus or minus sign by itself is also an identi- 
fier. A delimited period (not occurring within a num- 
ber or identifier) is used in the notation for pairs (sec- 
tion 6.3), and to indicate a rest-parameter in a formal 
parameter list (section 4.1.4). 

( ) Parentheses are used for grouping and to notate lists 
(section 6.3). 

' The single quote character is used to indicate literal data 
(section 4.1.2). 

* The backquote character is used to indicate almost- 

constant data (section 4.2.6). 

, , 6 The character comma and the sequence comma at- 
sign are used in conjunction with backquote (sec- 
tion 4.2.6). 

" The double quote character is used to delimit strings 
(section 6.7). 

\ Backslash is used in the syntax for character constants 
and as an escape character within string constants 
(section 6.7). 

[ ] { } Left and right square brackets and curly braces 
are reserved for possible future extensions to the lan- 
guage. 

# Sharp sign is used for a variety of purposes depending 

on the character that immediately follows it: 

#t #f These are the boolean constants (section 6.1). 

#\ This introduces a character constant (section 6.6). 

#( This introduces a vector constant (section 6.8). Vector 
constants are terminated by ) . 

#e #i #1 #b #b #o #d #x These are used in the nota- 
tion for numbers (section 6.5.3). 



3. Basic concepts 

3.1. Variables and regions 

Any identifier that is not a syntactic keyword (see sec- 
tion 2.1) may be used as a variable. A variable may name 
a location where a value can be stored. A variable that 
does so is said to be bound to the location. The set of all 
such bindings in effect at some point in a program is known 
as the environment in effect at that point. The value stored 
in the location to which a variable is bound is called the 
variable's value. By abuse of terminology, the variable is 
sometimes said to name the value or to be bound to the 
value. This is not quite accurate, but confusion rarely re- 
sults from this practice. 
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Certain expression types are used to create new locations 
and to bind variables to those locations. The most funda- 
mental of these binding constructs is the lambda expres- 
sion, because all other binding constructs can be explained 
in terms of lambda expressions. The other binding con- 
structs are let, let*, letrec, and do expressions (see sec- 
tions 4.1.4, 4.2.2, and 4.2.4). 

Like Algol and Pascal, and unlike most other dialects of 
Lisp except for Common Lisp, Scheme is a statically scoped 
language with block structure. To each place where a vari- 
able is bound in a program there corresponds a region of the 
program text within which the binding is effective. The re- 
gion is determined by the particular binding construct that 
establishes the binding; if the binding is established by a 
lambda expression, for example, then its region is the en- 
tire lambda expression. Every reference to or assignment 
of a variable refers to the binding of the variable that es- 
tablished the innermost of the regions containing the use. 
If there is no binding of the variable whose region contains 
the use, then the use refers to the binding for the variable 
in the top level environment, if any (section 6); if there is 
no binding for the identifier, it is said to be unbound. 



External representations can also be used for input and 
output. The procedure read (section 6.10.2) parses ex- 
ternal representations, and the procedure write (sec- 
tion 6.10.3) generates them. Together, they provide an 
elegant and powerful input/output facility. 

Note that the sequence of characters "(+ 2 6)" is not an 
external representation of the integer 8, even though it is an 
expression evaluating to the integer 8; rather, it is an exter- 
nal representation of a three-element list, the elements of 
which are the symbol + and the integers 2 and 6. Scheme's 
syntax has the property that any sequence of characters 
which is an expression is also the external representation 
of some object. This can lead to confusion, since it may 
not be obvious out of context whether a given sequence of 
characters is intended to denote data or program, but it is 
also a source of power, since it facilitates writing programs 
such as interpreters and compilers which treat programs as 
data (or vice versa). 

The syntax of external representations of various kinds of 
objects accompanies the description of the primitives for 
manipulating the objects in the appropriate sections of 
chapter 6. 



3.2. True and false 

Any Scheme value can be used as a boolean value for the 
purpose of a conditional test. As explained in section 6.1, 
all values count as true in such a test except for #f and 
the empty list, which count as false. This report uses the 
word "true" to refer to any Scheme value that counts as 
true, and the word "false" to refer to any Scheme value 
that counts as false. 



3.3. External representations 

An important concept in Scheme (and Lisp) is that of the 
external representation of an object as a sequence of charac- 
ters. For example, an external representation of the integer 
28 is the sequence of characters "28" , and an external rep- 
resentation of a list consisting of the integers 8 and 13 is 
the sequence of characters "(8 13)*. 

The external representation of an object is not necessarily 
unique. The integer 28 also has representations "28.000" 
and "#xlc", and the list in the previous paragraph also has 
the representations "( 08 13 )" and "(8 . (13 . ()))" 
(see section 6.3). 

Many objects have standard external representations, but 
some, such as procedures, do not have standard represen- 
tations (although particular implementations may define 
representations for them). 

An external representation may be written in a program to 
obtain the corresponding object (see quote, section 4.1.2). 



4. Expressions 

A Scheme expression is a construct that returns a value, 
such as a variable reference, literal, procedure call, or con- 
ditional. 

Expression types are categorized as primitive or derived. 
Primitive expression types include variables and procedure 
calls. Derived expression types are not semantically primi- 
tive, but can instead be explained in terms of the primitive 
constructs as in section 7.3. They are redundant in the 
strict sense of the word, but they capture common pat- 
terns of usage, and are therefore provided as convenient 
abbreviations. 



4.1. Primitive expression types 
4.1.1. Variable references 



(variable) 



essential syntax 



An expression consisting of a variable (section 3.1) is a 
variable reference. The value of the variable reference is 
the value stored in the location to which the variable is 
bound. It is an error to reference an unbound variable. 



(define x 28) 



28 
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SUMMARY 



The report gives a defining description of the program- 
ming language Scheme. Scheme is a statically scoped and 
properly tail-recursive dialect of the Lisp programming 
language invented by Guy Lewis Steele Jr. and Gerald 
Jay Sussman. It was designed to have an exceptionally 
clear and simple semantics and few different ways to form 
expressions. A wide variety of programming paradigms, in- 
cluding imperative, functional, and message passing styles, 
find convenient expression in Scheme. 

The introduction offers a brief history of the language and 
of the report. 

The first three chapters present the fundamental ideas of 
the language and describe the notational conventions used 
for describing the language and for writing programs in the 
language. 

Chapters 4 and 5 describe the syntax and semantics of 
expressions, programs, and definitions. 

Chapter 6 describes Scheme's built-in procedures, which 
include all of the language's data manipulation and in- 
put/output primitives. 

Chapter 7 provides a formal syntax for Scheme written in 
extended BNF, along with a formal denotational semantics. 

The report concludes with an example of the use of the 
language and an alphabetic index. 
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INTRODUCTION 



Programming languages should be designed not by piling 
feature on top of feature, but by removing the weaknesses 
and restrictions that make additional features appear nec- 
essary. Scheme demonstrates that a very small number of 
rules for forming expressions, with no restrictions on how 
they are composed, suffice to form a practical and efficient 
programming language that is flexible enough to support 
most of the major programming paradigms in use today. 

Scheme has influenced the evolution of Lisp. Scheme was 
one of the first programming languages to incorporate first 
class procedures as in the lambda calculus, thereby proving 
the usefulness of static scope rules and block structure in 
a dynamically typed language. Scheme was the first major 
dialect of Lisp to distinguish procedures from lambda ex- 
pressions and symbols, to use a single lexical environment 
for all variables, and to evaluate the operator position of 
a procedure call in the same way as an operand position. 
By relying entirely on procedure calls to express iteration, 
Scheme emphasized the fact that tail-recursive procedure 
calls are essentially goto's that pass arguments. Scheme 
was the first widely used programming language to em- 
brace first class escape procedures, from which all known 
sequential control structures can be synthesized. A few 
of these innovations have recently been incorporated into 
Common Lisp, while others remain to be adopted. 

Background 

The first description of Scheme was written in 1975 [48]. A 
revised report [44] appeared in 1978, which described the 
evolution of the language as its MIT implementation was 
upgraded to support an innovative compiler [41]. Three 
distinct projects began in 1981 and 1982 to use variants 
of Scheme for courses at MIT, Yale, and Indiana Univer- 
sity [30,23,10]. An introductory computer science textbook 
using Scheme was published in 1984 [l]. 

As might be expected of a language used primarily for ed- 
ucation and research, Scheme has always evolved rapidly. 
This was no problem when Scheme was used only within 
MIT, but as Scheme became more widespread, local di- 
alects began to diverge until students and researchers oc- 
casionally found it difficult to understand code written at 
other sites. 

Fifteen representatives of the major implementations of 
Scheme therefore met in October 1984 to work toward 
a better and more widely accepted standard for Scheme. 
Participating in this workshop were Hal Abelson, Nor- 
man Adams, David Bartley, Gary Brooks, William dinger, 
Daniel Friedman, Robert Halstead, Chris Hanson, Christo- 
pher Haynes, Eugene Kohlbecker, Don Oxley, Jonathan 
Rees, Guillermo Rozas, Gerald Jay Sussman, and Mitchell 
Wand. Kent Pitman made valuable contributions to the 



agenda for the workshop but was unable to attend the ses- 
sions. 

Subsequent electronic mail discussions and committee work 
completed the definition of the language. Gerry Sussman 
drafted the section on numbers, Chris Hanson drafted the 
sections on characters and strings, and Gary Brooks and 
William Clinger drafted the sections on input and output. 
William Clinger recorded the decisions of the workshop and 
compiled the pieces into a coherent document. The "Re- 
vised revised report on Scheme* [4] was published at MIT 
and Indiana University in the summer of 1985. Another 
round of revision in the spring of 1986, again accomplished 
almost entirely by electronic mail, resulted in the present 
report. 

We intend this report to belong to the entire Scheme com- 
munity, and so we grant permission to copy it in whole or in 
part without fee. In particular, we encourage implementors 
of Scheme to use this report as a starting point for manuals 
and other documentation, modifying it as necessary. 
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4.1.2. Literal expressions 

(quote (datum)) essential syntax 

' (datum) essential syntax 

(constant) essential syntax 

(quote (datum)) evaluates to (datum). (Datum) may be 
any external representation of a Scheme object (see sec- 
tion 3.3). This notation is used to include literal constants 
in Scheme code. 



(quote a) 
(quote #(a be)) 
(quote (+ 1 2)) 



a 

#(a b c) 

(+ 1 2) 



(quote (datum)) may be abbreviated as '(datum). The 
two notations are equivalent in all respects. 



'a 

•#(a b c) 
'(+ 1 2) 

* (quote a) 



a 

#(a b c) 
(+ 1 2) 
(quote a) 
(quote a) 



Numeric constants, string constants, character constants, 
and boolean constants evaluate "to themselves" ; they need 
not be quoted. 



"'abc 1 ' 

"abc" 

'145932 

145932 
•#t 

#t 



"abc" 

"abc" 

146932 

146932 

#t 

#t 



It is an error to alter a constant (i.e. the value of a literal 
expression) using a mutation procedure like set-car! or 
string-set!. 

4.1.3. Procedure calls 

((operator) (operandi) •••) essential syntax 

A procedure call is written by simply enclosing in parenthe- 
ses expressions for the procedure to be called and the argu- 
ments to be passed to it. The operator and operand expres- 
sions are evaluated (in an indeterminate order) and the re- 
sulting procedure is passed the resulting arguments. 



(+ 3 4) 

((if #f + *) 3 4) 



7 

12 



A number of procedures are available as the values of vari- 
ables in the initial environment; for example, the addition 
and multiplication procedures in the above examples are 
the values of the variables + and *. New procedures are cre- 
ated by evaluating lambda expressions (see section 4.1.4). 

Procedure calls are also called combinations. 

Note: In contrast to other dialects of Lisp, the order of 
evaluation is unspecified, and the operator expression and the 
operand expressions are always evaluated with the same evalu- 
ation rules. 



Note: In many dialects of Lisp, the empty combination, (), 
is a legitimate expression. In Scheme, combinations must have 
at least one subexpression, so () is not a syntactically valid 
expression. 

4.1.4. Lambda expressions 

(lambda (formals) (body)) essential syntax 

Syntax: (Formals) should be a formal arguments list as 
described below, and (body) should be a sequence of one 
or more expressions. 

Semantics: A lambda expression evaluates to a procedure. 
The environment in effect when the lambda expression was 
evaluated is remembered as part of the procedure. When 
the procedure is later called with some actual arguments, 
the environment in which the lambda expression was evalu- 
ated will be extended by binding the variables in the formal 
argument list to fresh locations, the corresponding actual 
argument values will be stored in those locations, and the 
expressions in the body of the lambda expression will be 
evaluated sequentially in the extended environment. The 
result of the last expression in the body will be returned 
as the result of the procedure call. 

(lambda (x) (+ x x)) ==► a procedure 

((lambda (x) (+ x x)) 4) =>• 8 

(define reverse-subtract 
(lambda (x y) (- y x))) 
(reverse-subtract 7 10) => 3 

(define f oo 
(let ((x 4)) 

(lambda (y) (+ x y)))) 
(foo 6) => 10 

(Formals) should have one of the following forms: 

• ((variablei) ...): The procedure takes a fixed num- 
ber of arguments; when the procedure is called, the 
arguments will be stored in the bindings of the corre- 
sponding variables. 

• (variable): The procedure takes any number of ar- 
guments; when the procedure is called, the sequence 
of actual arguments is converted into a newly allo- 
cated list, and the list is stored in the binding of the 
(variable). 

• ((variablei) ••• (variable„_i) . (variable n )): If a 
space-delimited period precedes the last variable, then 
the value stored in the binding of the last variable will 
be a newly allocated list of the actual arguments left 
over after all the other actual arguments have been 
matched up against the other formal arguments. 

((lambda x x) 3 4 6 6) =► (3 4 6 6) 

((lambda (x y .2) z) 
3 4 5 6) => (6«) 
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4.1.5. Conditionals 

(if (test) (consequent) (alternate)) essential syntax 

(if (test) (consequent)) syntax 

Syntax: (Test), (consequent), and (alternate) maybe arbi- 
trary expressions. 

Semantics: An if expression is evaluated as follows: first, 
(test) is evaluated. If it yields a true value (see section 6.1), 
then (consequent) is evaluated and its value is returned. 
Otherwise (alternate) is evaluated and its value is returned. 
If (test) yields a false value and no (alternate) is specified, 
then the result of the expression is unspecified. 



(if (> 3 2) 


'yea *no) 


=> yes 


(if (> 2 3) 


'yes 'no) 


=> no 


(if (> 3 2) 






(- 3 2) 







(+ 3 2)) => 1 

4.1.6. Assignments 

(set! (variable) (expression)) essential syntax 

(Expression) is evaluated, and the resulting value is stored 
in the location to which (variable) is bound. (Variable) 
must be bound either in some region enclosing the Bet! 
expression or at top level. The result of the set ! expression 
is unspecified. 



(define x 
(+ x 1) 
(set! x 4) 
(+ x 1) 



2) 



unspecified 
6 



4.2. Derived expression types 

For reference purposes, section 7.3 gives rewrite rules that 
will convert constructs described in this section into the 
primitive constructs described in the previous section. 

4.2.1. Conditionals 

(cond (clausea ) (clauses) ...) essential syntax 

Syntax: Each (clause) should be of the form 

((test) (expression) ...) 

where (test) is any expression. The last (clause) may be 
an "else clause," which has the form 

(else (expression i) (expressions) •••)• 

Semantics: A cond expression is evaluated by evaluating 
the (test) expressions of successive (clause)s in order until 
one of them evaluates to a true value (see section 6.1). 
When a (test) evaluates to a true value, then the remaining 
(expression)s in its (clause) are evaluated in order, and the 
result of the last (expression) in the (clause) is returned 
as the result of the entire cond expression. If the selected 



(clause) contains only the (test) and no (expression)s, then 
the value of the (test) is returned as the result. If all (test)s 
evaluate to false values, and there is no else clause, then 
the result of the conditional expression is unspecified; if 
there is an else clause, then its (expression)s are evaluated, 
and the value of the last one is returned. 



(cond ((> 3 2) 'greater) 
((< 3 2) 'less)) 

(cond ((> 3 3) 'greater) 
((< 3 3) 'less) 
(else 'equal)) 



greater 



equal 



Some implementations support an alternative (clause) syn- 
tax, ((test) -> (recipient)), where (recipient) is an expres- 
sion. If (test) evaluates to a true value, then (recipient) is 
evaluated. Its value must be a procedure of one argument; 
this procedure is then invoked on the value of the (test). 

(cond ((assv 'b '((a 1) (b 2))) => cadr) 
(else #f)) ==>• 2 



(case (key) (clausei) (clauses) ...) syntax 

Syntax: (Key) may be any expression. Each (clause) 
should have the form 

(((datumi) ...) {expression:) (expressions) •••), 

where each (datum) is an external representation of some 
object. All the (datum)s must be distinct. The last (clause) 
may be an "else clause," which has the form 

(else (expressioni) (expressions) ...). 

Semantics: A case expression is evaluated as follows. 
(Key) is evaluated and its result is compared against each 
(datum). If the result of evaluating (key) is equivalent 
(in the sense of eqv?; see section 6.2) to a (datum), then 
the expressions in the corresponding (clause) are evaluated 
from left to right and the result of the last expression in 
the (clause) is returned as the result of the case expres- 
sion. If the result of evaluating (key) is different from every 
(datum), then if there is an else clause its expressions are 
evaluated and the result of the last is the result of the case 
expression; otherwise the result of the case expression is 
unspecified. 

(case (* 2 3) 

((2357) 'prime) 

((14689) 'composite)) => composite 
(case (car ' (c d)) 

((a) 'a) 

((b) *b)) =*► unspecified 

(case (car ' (c d)) 

((a e i o u) 'vowel) 

((w y) ' semivowel) 

(else 'consonant)) =*■ consonant 
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(and (testi) ...) 



syntax 



The (test) expressions are evaluated from left to right, and 
the value of the first expression that evaluates to a false 
value (see section 6.1) is returned. Any remaining expres- 
sions are not evaluated. If all the expressions evaluate to 
true values, the value of the last expression is returned. If 
there are no expressions then #t is returned. 



(and (- 2 2) (> 2 1)) 
(and (- 2 2) (< 2 1)) 
(and 1 2 'c '(i g)) 
(and) 



(or (testx) .. .) 



#f 
(f g) 

#t 



syntax 

The (test) expressions are evaluated from left to right, and 
the value of the first expression that evaluates to a true 
value (see section 6.1) is returned. Any remaining expres- 
sions are not evaluated. If all expressions evaluate to false 
values, the value of the last expression is returned. If there 
are no expressions then #f is returned. 



(or (- 2 2) (> 2 1)) 


=> 


#t 


(or (- 2 2) (< 2 1)) 


=> 


#t 


(or #f #f #f) 


=> 


#f 


(or (meaq 'b '(a b c)) 






(/ 3 0)) 


=> 


(b 



4.2.2. Binding constructs 

The three binding constructs let, let*, and letrec give 
Scheme a block structure, like Algol 60. The syntax of the 
three constructs is identical, but they differ in the regions 
they establish for their variable bindings. In a let ex- 
pression, the initial values are computed before any of the 
variables become bound; in a let* expression, the bind- 
ings and evaluations are performed sequentially; while in 
a letrec expression, the bindings are in effect while their 
initial values are being computed, thus allowing mutually 
recursive definitions. 



(let (bindings) (body)) essential syntax 

Syntax: (Bindings) should have the form 

(((variablei) (initi)) ...), 

where each (init) is an expression, and (body) should be a 
sequence of one or more expressions. 

Semantics: The (init)s are evaluated in the current envi- 
ronment (in some unspecified order), the (variable)s are 
bound to fresh locations holding the results, the (body) is 
evaluated in the extended environment, and the value of 
the last expression of (body) is returned. Each binding of 
a (variable) has (body) as its region. 



(let ((x 2) (y 3)) 

(* x y)) =*► 6 

(let ((x 2) (y 3)) 

(let ((foo (lambda (z) (+ x y z))) 
(x 7)) 
(foo 4))) =► 

See also named let, section 4.2.4. 



(let* (bindings) (body)) syntax 

Syntax: (Bindings) should have the form 

(((variablei) (initi)) ...), 

and (body) should be a sequence of one or more expres- 
sions. 

Semantics: Let* is similar to let, but the bindings are 
performed sequentially from left to right, and the region of 
a binding indicated by ((variable) (init)) is that part of 
the let* expression to the right of the binding. Thus the 
second binding is done in an environment in which the first 
binding is visible, and so on. 

(let* ((x 1) (y (+ x 1))) 
y) => 2 



essential syntax 



(letrec (bindings) (body)) 

Syntax: (Bindings) should have the form 

(((variablei) (initi)) ...), 

and (body) should be a sequence of one or more expres- 
sions. 

Semantics: The (variable)s are bound to fresh locations 
holding undefined values, the (init)s are evaluated in the 
resulting environment (in some unspecified order), each 
(variable) is assigned to the result of the corresponding 
(init), the (body) is evaluated in the resulting environment, 
and the value of the last expression in (body) is returned. 
Each binding of a (variable) has the entire letrec expres- 
sion as its region, making it possible to define mutually 
recursive procedures. 

(letrec ((even? 

(lambda (n) 
(if (zero? n) 
#t 

(odd? (- n 1))))) 
(odd? 
(lambda (n) 
(if (zero? n) 
#f 

(even? (- n 1)))))) 
(even? 88)) 

=> #t 
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One restriction on letrec is very important: it must be 
possible to evaluate each (init) without referring to the 
value of any (variable). If this restriction is violated, then 
the effect is undefined, and an error may be signalled dur- 
ing evaluation of the (init}s. The restriction is necessary 
because Scheme passes arguments by value rather than by 
name. In the most common uses of letrec, all the (init)s 
are lambda expressions and the restriction is satisfied au- 
tomatically. 

4.2.3. Sequencing 

(begin (expressioni) (expressions) ...) essential syntax 

The (expression) s are evaluated sequentially from left to 
right, and the value of the last (expression) is returned. 
This expression type is used to sequence side effects such 
as input and output. 



(begin (set! x 6) 
(+ x i)) 



a 



(begin (display "4 plus 1 equals ") 

(display (+ 4 1))) =>• unspecified 
and prints 4 plus 1 equals 6 

Note: [l] uses the keyword sequence instead of begin. 



4.2.4. Iteration 

(do (((variablei) (initi) (stepi)) syntax 

...) 

((test) (expression) ...) 
(command) . . . ) 

Do is an iteration construct. It specifies a set of variables 
to be bound, how they are to be initialized at the start, 
and how they are to be updated on each iteration. When a 
termination condition is met, the loop exits with a specified 
result value. 

Do expressions are evaluated as follows: The (init) ex- 
pressions are evaluated (in some unspecified order), the 
(variable)s are bound to fresh locations, the results of 
the (init) expressions are stored in the bindings of the 
(variable)s, and then the iteration phase begins. 

Each iteration begins by evaluating (test); if the result is 
false (see section 6.1), then the (command) expressions are 
evaluated in order for effect, the (step) expressions are eval- 
uated in some unspecified order, the (variable)s are bound 
to fresh locations, the results of the (step)s are stored in the 
bindings of the (variable)s, and the next iteration begins. 

If (test) evaluates to a true value, then the (expression)s 
are evaluated from left to right and the value of the last 
(expression) is returned as the value of the do expression. 
If no (expression)s are present, then the value of the do 
expression is unspecified. 



The region of the binding of a (variable) consists of the 
entire do expression except for the (init)s. 

A (step) may be omitted, in which case the effect is the 
same as if ((variable) (init) (variable)) had been written 
instead of ((variable) (init)). 



(do ((vec (make-vector 6)) 
(i (+ i 1))) 
((- i 6) vac) 
(vector-set! vec 11)) 



#(0 12 3 4) 



(let ((x "(1367 9))) 
(do ((x x (cdr x)) 

(sum (+ sum (car x)))) 
((null? x) sum))) =*• 2E 



(let (variable) (bindings) (body)) 



syntax 



Some implementations of Scheme permit a variant on the 
syntax of let called "named let" which provides a more 
general looping construct than do, and may also be used 
to express recursions. 

Named let has the same syntax and semantics as ordi- 
nary let except that (variable) is bound within (body) to 
a procedure whose formal arguments are the bound vari- 
ables and whose body is (body). Thus the execution of 
(body) may be repeated by invoking the procedure named 
by (variable). 

(let loop ((numbers '(3-216 -5)) 
(nonneg '()) 
(neg '())) 
(cond ((null? numbers) (list nonneg neg)) 
((>» (car numbers) 0) 
(loop (cdr numbers) 

(cons (car numbers) nonneg) 
neg)) 
((< (car numbers) 0) 
(loop (cdr numbers) 
nonneg 

(cons (car numbers) neg))))) 
<=> ((6 13) (-B -2)) 



4.2.5. Delayed evaluation 

(delay (expression)) syntax 

The delay construct is used together with the procedure 
force to implement lazy evaluation or call by need, (delay 
(expression)) returns an object called a promise which at 
some point in the future may be asked (by the force pro- 
cedure) to evaluate (expression) and deliver the resulting 
value. 

See the description of force (section 6.9) for a complete 
description of delay. 
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4.2.6. Quasiquotation 

(quasiquote (template)) 
* (template) 



syntax 

syntax 



"Backquote" or "quasiquote* expressions are useful for 
constructing a list or vector structure when most but not 
all of the desired structure is known in advance. If no 
commas appear within the (template), the result of evalu- 
ating * (template) is equivalent to the result of evaluating 
'(template). If a comma appears within the (template), 
however, the expression following the comma is evaluated 
("unquoted") and its result is inserted into the structure 
instead of the comma and the expression. If a comma ap- 
pears followed immediately by an at-sign (fi), then the fol- 
lowing expression must evaluate to a list; the opening and 
closing parentheses of the list are then "stripped away" and 
the elements of the list are inserted in place of the comma 
at-sign expression sequence. 

"(list .(+ 1 2) 4) => (list 3 4) 

(let ((name 'a)) * (list .name '.name)) 

=*• (list a (quote a)) 
"(a ,(+ 1 2) ,C(map abs '(4 -5 6)) b) 

==*• (a 3 4 B 6 b) 
*((foo ,(- 10 3)) ,C(cdr '(c)) . .(car '(cons))) 

=> ((loo 7) . cons) 
•#(10 6 .(sqrt 4) ,B(map sqrt "(16 9)) 8) 

=> #(10 5 2 4 3 8) 

Quasiquote forms may be nested. Substitutions are made 
only for unquoted components appearing at the same nest- 
ing level as the outermost backquote. The nesting level in- 
creases by one inside each successive quasiquotation, and 
decreases by one inside each unquotation. 

*(a *(b .(♦ 1 2) ,(foo .(+13) d) e) f) 



(a ' (b , (+ 1 2) . (too 4 d) e) f ) 



(let ((namel *x) 
(name2 *y)) 
* (a *(b . , namel 
(a 



' ,name2 d) e)) 
(b ,x . 'j d) e) 

The notations * (template) and (quasiquote (template)) 
are identical in all respects. , (expression) is identical to 
(unquote (expression)), and ,9 (expression) is identical to 
(unquote-splicing (expression)). The external syntax 
generated by write for two-element lists whose car is one 
of these symbols may vary between implementations. 

(quasiquote (list (unquote (+ 1 2)) 4)) 

=> (list 3 4) 
'(quasiquote (list (unquote (+ 1 2)) 4)) 

=> '(list .(+12) 4) 
i.e., (quasiquote (list (unquote (+ 1 2)) 4)) 

Unpredictable behavior can result if any of the symbols 
quasiquote, unquote, or unquote-splicing appear in 
positions within a (template) otherwise than as described 
above. 



5. Program structure 
5.1. Programs 

A Scheme program consists of a sequence of expressions 
and definitions. Expressions are described in chapter 4; 
definitions are the subject of the rest of the present chapter. 

Programs are typically stored in files or entered inter- 
actively to a running Scheme system, although other 
paradigms are possible; questions of user interface lie out- 
side the scope of this report. (Indeed, Scheme would still 
be useful as a notation for expressing computational meth- 
ods even in the absence of a mechanical implementation.) 

Definitions occurring at the top level of a program are in- 
terpreted declaratively. They cause bindings to be cre- 
ated in the top level environment. Expressions occurring 
at the top level of a program are interpreted imperatively; 
they are executed in order when the program is invoked or 
loaded, and typically perform some kind of initialization. 



5.2. Definitions 

Definitions are valid in some, but not all, contexts where 
expressions are allowed. They are valid only at the top 
level of a (program) and, in some implementations, at the 
beginning of a (body). 

A definition should have one of the following forms: 

• (define (variable) (expression)) 
This syntax is essential. 

• (define ((variable) (formals)) (body)) 

This syntax is not essential. (Formals) should be ei- 
ther a sequence of zero or more variables, or a sequence 
of one or more variables followed by a space-delimited 
period and another variable (as in a lambda expres- 
sion). This form is equivalent to 

(define (variable) 

(lambda ((formals)) (body))). 

• (define ((variable) . (formal)) (body)) 

This syntax is not essential. (Formal) should be a 
single variable. This form is equivalent to 

(define (variable) 

(lambda (formal) (body))). 

5.2.1. Top level definitions 

At the top level of a program, a definition 

(define (variable) (expression)) 

has essentially the same effect as the assignment expres- 
sion 

(set ! (variable) (expression)) 
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if (variable) is bound. If (variable) is not bound, however, 
then the definition will bind (variable) to a new location 
before performing the assignment, whereas it would be an 
error to perform a set ! on an unbound variable. 



(define add3 

(lambda (x) (+ x 3))) 
(add3 3) 

(define first car) 
(first *(1 2)) 



All Scheme implementations must support top level defini- 
tions. 

Some implementations of Scheme use an initial environ- 
ment in which all possible variables are bound to locations, 
most of which contain undefined values. Top level defini- 
tions in such an implementation are truly equivalent to 
assignments. 

5.2.2. Internal definitions 

Some implementations of Scheme permit definitions to oc- 
cur at the beginning of a (body) (that is, the body of a 
lambda, let, let*, letrec, or define expression). Such 
definitions are known as inferno/ definitions as opposed to 
the top level definitions described above. The variable de- 
fined by an internal definition is local to the (body). That 
is, (variable) is bound rather than assigned, and the region 
of the binding is the entire (body). For example, 



(let ((x 5)) 

(define foo (lambda (y) (bar x y))) 
(define bar (lambda (a b) (+ (* a b) 
(foo (+ x 3))) =► 46 



a))) 



A (body) containing internal definitions can always be con- 
verted into a completely equivalent letrec expression. For 
example, the let expression in the above example is equiv- 
alent to 

(let ((x 6)) 

(letrec ((foo (lambda (y) (bar x y))) 

(bar (lambda (a b) (+ (* a b) a)))) 
(foo (+ x 3)))) 



6. Standard procedures 

This chapter describes Scheme's built-in procedures. The 
initial (or "top level") Scheme environment starts out with 
a number of variables bound to locations containing useful 
values, most of which are primitive procedures that manip- 
ulate data. For example, the variable abs is bound to (a 
location initially containing) a procedure of one argument 
that computes the absolute value of a number, and the 
variable + is bound to a procedure that computes sums. 



6.1. Booleans 

The standard boolean objects for true and false are written 
as #t and #f . What really matters, though, are the objects 
that the Scheme conditional expressions (if, cond, and, 
or, do) treat as true or false. The phrase "a true value* 
(or sometimes just "true") means any object treated as 
true by the conditional expressions, and the phrase "a false 
value" (or "false") means any object treated as false by the 
conditional expressions. 

Of all the standard Scheme values, only #f and the empty 
list count as false in conditional expressions. Everything 
else, including #t, pairs, symbols, numbers, strings, vec- 
tors, and procedures, counts as true. 

The empty list counts as false for compatibility with exist- 
ing programs and implementations that assume this to be 
the case. 

Programmers accustomed to other dialects of Lisp should 
beware that Scheme distinguishes false and the empty list 
from the symbol nil. 

Boolean constants evaluate to themselves, so they don't 
need to be quoted in programs. 



#t 
#f 
*#f 



#t 
#f 
#f 



(not obj) essential procedure 

Not returns #t if obj is false, and returns #f otherwise. 



(not #t) 
(not 3) 

(not (list 3)) 
(not #f) 
(not '()) 
(not (list)) 



(boolean? obj) 



#f 
#f 
#f 
#t 
#t 
#t 



essential procedure 

Boolean? returns #t if obj is either #t or #f and returns 
#f otherwise. 



(boolean? #f ) 

(boolean? 0) 



#t 
#f 



nil 
t 



variable 
variable 



Some implementations provide variables nil and t whose 
values in the initial environment are #f and #t respectively. 



t 

nil 

•nil 



#t 
#f 
nil 
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6.2. Equivalence predicates 

A predicate is a procedure that always returns a boolean 
value (#t or #f). An equivalence predicate is the compu- 
tational analogue of a mathematical equivalence relation 
(it is symmetric, reflexive, and transitive). Of the equiva- 
lence predicates described in this section, eq? is the finest 
or most discriminating, and equal? is the coarsest. Eqv? 
is slightly less discriminating than eq?. 

Two objects are operationally equivalent if and only if there 
is no way that they can be distinguished, using Scheme 
primitives other than eqv? or eq? or those like memq and 
assv whose meaning is defined explicitly in terms of eqv? 
or eq?. It is guaranteed that objects maintain their opera- 
tional identity despite being named by variables or fetched 
from or stored into data structures. 

This definition can be interpreted in the following ways for 
various kinds of objects: 

• The two boolean values, #t and #f , are operationally 
distinct because they behave in opposite ways in con- 
ditionals. 

• Two symbols are operationally equivalent if they print 
the same way. 

• Two numbers are operationally equivalent if they are 
numerically equal (see *», section 6.5.4) and are either 
both exact or both inexact (see section 6.5.2). 

• Two characters are operationally equivalent if they are 
the same character according to char=? (section 6.6). 

• Two structured mutable objects — pairs, vectors, or 
strings — are operationally equivalent if they have op- 
erationally equivalent values in corresponding posi- 
tions, and applying a mutation procedure to one 
causes the other to change as well. (A mutation pro- 
cedure is one like set-car! which alters a data struc- 
ture.) For example, two pairs are not operationally 
equivalent if a set-car! operation on one does not 
change the car field of the other. 

• There is only one empty list, and it is operationally 
equivalent to itself. All empty vectors are opera- 
tionally equivalent to each other. All empty strings 
are operationally equivalent to each other. Whether 
there is more than one empty vector or string is 
implementation-dependent. 

• Two procedures are operationally equivalent if, when 
called on operationally equivalent arguments, they re- 
turn the same value and perform the same side effects. 



(eqv? obji 06/2) essential procedure 

The eqv? procedure implements an approximation to the 
relation of operational equivalence. It returns #t if it can 



prove that obj\ and obfa are operationally equivalent. If it 
can't, it always errs on the conservative side and returns 
#f. 

The only situation in which it might fail to prove is when 
obji and obj'2 are operationally equivalent procedures that 
were created at different times. In general, operational 
equivalence of procedures is uncomputable, but it is guar- 
anteed that eqv? can recognize a procedure created at a 
given time by a given lambda expression as "being itself." 
This is useful for applications in which procedures are be- 
ing used to implement objects with local state. 



(eqv? *a 'a) 

(eqv? 'a *b) 

(eqv? 2 2) 

(eqv? '() '()) 

(eqv? "" "") 

(eqv? 100000000 100000000) 

(eqv? (cons 1 2) (cons 12)) 

(eqv? (lambda () 1) 

(lambda () 2)) 
(eqv? #f 'nil) 
(let ((p (lambda (x) x))) 
(eqv? p p)) 



#t 
#f 
#t 
#t 
*t 
#t 
#f 

#f 
#f 

#t 



The following examples illustrate cases in which eqv? is 
permitted to fail to prove operational equivalence, depend- 
ing on the implementation. (In every case, it will return 
either #t or #f , but which one it returns is implementation- 
dependent.) Compare with the last example in the previ- 
ous set. 



(eqv? (lambda (x) x) 

(lambda (x) x)) 

(eqv? (lambda (x) x) 

(lambda (y) y)) 



unspecified 
unspecified 



The next set of examples shows the use of eqv? with pro- 
cedures that have local state. Gen-counter must return a 
distinct procedure every time, since each procedure has its 
own internal counter. Gen-loser, however, returns equiv- 
alent procedures each time, since the local state does not 
affect the value or side effects of the procedures. 

(define gen-counter 
(lambda () 
(let ((n 0)) 

(lambda () (set! n (+ n 1)) n)))) 
(let ((g (gen-counter))) 

(eqv? g g)) => #t 

(eqv? (gen-counter) (gen-counter)) 

=>• #f 
(define gen-loser 
(lambda () 
(let ((n 0)) 

(lambda () (set! n (+ n 1)) 27)))) 
(let ((g (gen-loser))) 

(eqv? g g)) ==>• #t 

(eqv? (gen-loser) (gen-loser)) 

=> unspecified 
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Objects of distinct types axe never operationally equivalent, 
except that false and the empty list are permitted to be 
identical, and the character type need not be disjoint from 
other types. 



(eqv? '() #f) 
(eqv? 67 #\A) 



unspecified 
unspecified 



Since it is an error to modify constant objects (those re- 
turned by literal expressions), implementations are per- 
mitted, though not required, to share structure between 
constants where appropriate. Thus the value of eqv? on 
constants is sometimes implementation-dependent. 



(let ((x *(a)» 

(eqv? x x)) 
(eqv? '(a) '(a)) 
(eqv? "a" "a") 
(eqv? * (b) (cdr • (a b) ) ) 



==> #t 

ass> unspecified 

=> unspecified 

=>■ unspecified 



Note: The above definition of eqv? allows implementations 
latitude in their treatment of procedures and literals: imple- 
mentations are free to either detect or fail to detect that two 
procedures or two literals are operationally equivalent to each 
other, and can decide whether or not to merge representations 
of equivalent objects by using the same pointer or bit pattern 
to represent both. 



(eq? obji ob}2) 



essential procedure 



Eq? is similar to eqv? except that in some cases it is capable 
of discerning distinctions finer than those detectable by 
eqv?. 

Eq? and eqv? are guaranteed to have the same behavior 
on symbols, booleans, the empty list, pairs, and non-empty 
strings and vectors. Eq?'s behavior on numbers and charac- 
ters is implementation-dependent, but it will always return 
either true or false, and will return true only when eqv? 
would also return true. Eq? may also behave differently 
from eqv? on empty vectors and empty strings. 



(eq? 'a *a) 




=> 


#t 


(eq? '(a) '(a)) 




=> 


unspecified 


(eq? (list 'a) (list 


•a)) 


==> 


#f 


(eq? "a" "a") 




=^> 


unspecified 


(eq? "" "") 




=» 


unspecified 


(eq? '() •()) 




=*• 


#t 


(eq? 2 2) 




=*■ 


unspecified 


(eq? #\A #\A) 




=» 


unspecified 


(eq? car car) 




=> 


#t 


(let ((n (+ 2 3))) 








(eq? n n)) 




=> 


unspecified 


(let ((x '(a))) 








(eq? x x)) 




=> 


#t 


(let ((x '#())) 








(eq? x x)) 




=> 


#t 


(let ((p (lambda (x) x))) 






(eq? p p)) 




=» 


#t 



Note: It will usually be possible to implement eq? much more 
efficiently than eqv?, for example, as a simple pointer compari- 
son instead of as some more complicated operation. One reason 
is that it may not be possible to compute eqv? of two numbers 
in constant time, whereas eq? implemented as pointer compar- 
ison will always finish in constant time. Eq? may be used like 
eqv? in applications using procedures to implement objects with 
state since it obeys the same constraints as eqv?. 

(equal? obj\ obj'2) essential procedure 

Equal? recursively compares the contents of pairs, vectors, 
and strings, applying eqv? on other objects such as num- 
bers and symbols. A rule of thumb is that objects are 
generally equal? if they print the same. Equal? may fail 
to terminate if its arguments are circular data structures. 



(equal? 'a 'a) 




=> 


#t 


(equal? * (a) "(a)) 




=> 


#t 


(equal? '(a (b) c) 








'(a (b) c)) 




=> 


#t 


(equal? "abc" "abc") 




=> 


#t 


(equal? 2 2) 




=>■ 


#t 


(equal? (make-vector 6 


'a) 






(make -vector 6 


•a)) 


=>• 


#t 


(equal? (lambda (x) x) 








(lambda (y) y)) 




==> 


unspecified 



6.3. Pairs and lists 

A pair (sometimes called a dotted pair) is a record structure 
with two fields called the car and cdr fields (for historical 
reasons). Pairs are created by the procedure cons. The 
car and cdr fields are accessed by the procedures car and 
cdr. The car and cdr fields are assigned by the procedures 
set-car! and set-cdr!. 

Pairs are used primarily to represent lists. A list can be 
defined recursively as either the empty list or a pair whose 
cdr is a list. The objects in the car fields of successive 
pairs of a list are the elements of the list. For example, 
a two-element list is a pair whose car is the first element 
and whose cdr is a pair whose car is the second element 
and whose cdr is the empty list. The length of a list is the 
number of elements, which is the same as the number of 
pairs. 

The empty list is a special object of its own type (it is not 
a pair); it has no elements and its length is zero. 

The most general notation (external representation) for 
Scheme pairs is the "dotted" notation (ci . 03) where 
ex is the value of the car field and c? is the value of the 
cdr field. For example (4 . 5) is a pair whose car is 4 and 
whose cdr is 5. Note that (4 . 5) is the external repre- 
sentation of a pair, not an expression that evaluates to a 
pair. 
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A more streamlined notation can be used for lists: the 
elements of the list are simply enclosed in parentheses and 
separated by spaces. The empty list is written () . For 
example, 



(a b c d e) 



and 

(a . (b . (c . (d . (e . ()))))) 
are both representations of the same list of symbols. 

A chain of pairs not ending in the empty list is called an 
improper list. Note that an improper list is not a list. 
The list and dotted notations can be combined to represent 
improper lists: 

(a b c . d) 

is equivalent to 

(a . (b . (c . d))) 

Whether a given pair is a list depends upon what is stored 
in the cdr field. When the set-cdr! procedure is used, an 
object can be a list one moment and not the next: 



(define x (list 'a *b 'c)> 
(define y x) 



(set-cdr I x 

x 

(eqv? x y) 

y 



4) 



=> (a b c) 

=> unspecified 

=> (a . 4) 

=> #t 

=> (a . 4) 



It is often convenient to speak of a homogeneous list of ob- 
jects of some particular data type, as for example (1 2 3) 
is a list of integers. To be more precise, suppose D is some 
data type. (Any predicate defines a data type consisting 
of those objects of which the predicate is true.) Then 

• The empty list is a list of D. 

• If list is a list of D, then any pair whose cdr is list and 
whose car is an element of the data type D is also a 
list of D. 

• There are no other lists of D. 

Within literal expressions and representations of ob- 
jects read by the read procedure, the forms '(datum), 
s (datum), .(datum), and , C(datum) denote two-ele- 
ment lists whose first elements are the symbols quote, 
quasiquote, unquote, and unquote-splicing, respec- 
tively. The second element in each case is (datum). This 
convention is supported so that arbitrary Scheme pro- 
grams may be represented as lists. That is, according 
to Scheme's grammar, every (expression) is also a (datum) 
(see section 7.1.2). Among other things, this permits the 
use of the read procedure to parse Scheme programs. See 
section 3.3. 



(pair? obj) essential procedure 

Pair? returns #t if obj is a pair, and otherwise returns #f . 



(pair? '(a . b)) 
(pair? '(a b c)) 
(pair? '()) 
(pair? *#(a b)) 



#t 
*t 
#f 
#f 



(cons obji obji) essential procedure 

Returns a newly allocated pair whose car is obji and whose 
cdr is 06/2- The pair is guaranteed to be different (in the 
sense of eqv?) from every existing object. 



(cons 


'a •()) 


=*► 


(a) 


(cons 


•(a) "(b c d)) 


=» 


((a) bed) 


(cons 


"a" *(b c)) 


=*■ 


("a" b c) 


(cons 


'a 3) 


=>• 


(a . 3) 


(cons 


'(ib) *c) 


=> 


((a b) . c) 



(car pair) essential procedure 

Returns the contents of the car field of pair. Note that it 
is an error to take the car of the empty list. 



(car '(a b c)) 
(car '((a) b c d)) 
(car g (l . 2)) 
(car '()) 



(a) 

1 

error 



(cdr pair) essential procedure 

Returns the contents of the cdr field of pair. Note that it 
is an error to take the cdr of the empty list. 



(cdr *((a) b c d)) 
(cdr '(1 . 2)) 
(cdr '()) 



(b c 
2 

error 



d) 



(set-car! pair obj) essential procedure 

Stores obj in the car field of pair. The value returned by 
set-car! is unspecified. 

(set-cdr! pair obj) essential procedure 

Stores obj in the cdr field of patr. The value returned by 
set-cdr! is unspecified. 



(caar pair) 
(cadr pair) 

(edddar pair) 
(eddddr pair) 



essential procedure 
essential procedure 

essential procedure 
essential procedure 



These procedures are compositions of car and cdr, where 
for example caddr could be defined by 



16 Revised 3 Scheme 



(define caddr (lambda (x) (car (cdr (cdr x))))). 

Arbitrary compositions, up to four deep, are provided. 
There are twenty-eight of these procedures in all. 



(null? obj) essential procedure 

Returns #t if obj is the empty list, otherwise returns #f . 
(In implementations in which the empty list is the same as 
#f , null? will return #t if obj is #f .) 



(list obj ...) 

Returns a list of its arguments. 

(list "a (+ 3 4) *c) 
(list) 

(length list) 

Returns the length of list. 

(length '(a b c)) 
(length '(a (b) (c d e))) 
(length '()) 



(append listi /ts^) 
(append list ...) 



essential procedure 



(a 7 c) 




essential procedure 



3 
3 




essential procedure 
procedure 



Returns a list consisting of the elements of the first list 
followed by the elements of the other lists. 



(append ' (x) '(y)) 
(append '(a) ' (b c d)) 
(append '(a (b)) '((c))) 



(x y) 
(abed) 
(a (b) (c)) 



The resulting list is always newly allocated, except that 
it shares structure with the last list argument. The last 
argument may actually be any object; an improper list 
results if it is not a proper list. 



(append * (a b) ' (c 
(append ' () *a) 



d)) =>• (a b c . d) 



(reverse list) 



procedure 

Returns a newly allocated list consisting of the elements of 
list in reverse order. 

(reverse '(a b c)) =*• (c b a) 

(reverse '(a (b c) d (e (1)))) 

=> ((e (f)) d (b c) a) 



(list-tail list k) procedure 

Returns the sublist of list obtained by omitting the first k 
elements. List-tail could be defined by 



(define list-tail 
(lambda (x k) 
(if (zero? k) 



(list-tail (cdr x) (- k 1))))) 



(list-ref list k) procedure 

Returns the kth. element of list. (This is the same as the 
car of (list-tail list k).) 

(list-ref ' (a b c d) 2) =^- c 



(last-pair list) procedure 

Returns the last pair in the nonempty, possibly improper, 
list list. Last-pair could be defined by 

(define last-pair 
(lambda (x) 

(if (pair? (cdr x)) 

(last-pair (cdr x)) 
x))) 



(memq obj list) essential procedure 

(memv obj list) essential procedure 

(member obj list) essential procedure 

These procedures return the first sublist of list whose car 
is obj. If obj does not occur in list, #f (n.b.: not the empty 
list) is returned. Memq uses eq? to compare obj with the 
elements of list, while memv uses eqv? and member uses 
equal?. 

(memq 'a '(a b c)) => (a b c) 

(memq *b *(a b c)) =*• (b c) 

(memq 'a '(be d)) =» #f 

(memq (list *a) • (b (a) c)) ==► #f 
(member (list *a) 

"(b (a) c)) => ((a) c) 

(memq 101 '(100 101 102)) =>■ unspecified 

(memv 101 '(100 101 102)) => (101 102) 



essential procedure 
essential procedure 
essential procedure 



(assq obj alist) 
(assv obj alist) 
(assoc obj alist) 

Alist (for "association list") must be a list of pairs. These 
procedures find the first pair in alist whose car field is obj, 
and returns that pair. If no pair in alist has obj as its car, 
#f is returned. Assq uses eq? to compare obj with the car 
fields of the pairs in alist, while assv uses eqv? and assoc 
uses equal?. 

(define e '((a 1) (b 2) (c 3))) 
(assq 'a e) =*■ (a 1) 

(assq 'b e) => (b 2) 

(assq *d e) =*• #f 

(assq (list 'a) '(((a)) ((b)) ((c)))) 
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==> #f 
(assoc (list "a) '(((a)) ((b)) ((c)))) 

=» ((a)) 
(assq 5 '((2 3) (6 7) (11 13))) 

=^ unspecified 
(assv 6 '((2 3) (5 7) (11 13))) 

=> (6 7) 

Note: Although they are ordinarily used as predicates, memq, 
memv, member, assq, assv, and assoc do not have question 
marks in their names because they return useful values rather 
than just #t or #f . 

6.4. Symbols 

Symbols are objects whose usefulness rests on the fact that 
two symbols are identical (in the sense of eqv?) if and only 
if their names are spelled the same way. This is exactly the 
property needed to represent identifiers in programs, and 
so most implementations of Scheme use them internally for 
that purpose. Symbols are useful for many other applica- 
tions; for instance, they may be used the way enumerated 
values are used in Pascal. 

The rules for writing a symbol are exactly the same as the 
rules for writing an identifier; see sections 2.1 and 7.1.1. 

It is guaranteed that any symbol that has been returned as 
part of a literal expression, or read using the read proce- 
dure, and subsequently written out using the write proce- 
dure, will read back in as the identical symbol (in the sense 
of eqv?). The 8tring->symbol procedure, however, can 
create symbols for which this write/read invariance may 
not hold because their names contain special characters or 
letters in the non-standard case. 

Note: Some implementations of Scheme have a feature known 
as "slashification" in order to guarantee write/read invariance 
for all symbols, but historically the most important use of this 
feature has been to compensate for the lack of a string data 
type. 

Some implementations also have "uninterned symbols", which 
defeat write/read invariance even in implementations with 
slashification, and also generate exceptions to the rule that two 
symbols are the same if and only if their names are spelled the 
same. 



expression (section 4.1.2) or by a call to the read proce- 
dure, and its name contains alphabetic characters, then the 
string returned will contain characters in the implementa- 
tion's preferred standard case— some implementations will 
prefer upper case, others lower case. If the symbol was re- 
turned by string->symbol, the case of characters in the 
string returned will be the same as the case in the string 
that was passed to string->symbol. It is an error to apply 
mutation procedures like string-set ! to strings returned 
by this procedure. 

The following examples assume that the implementation's 
standard case is lower case: 



(symbol->string 'flying-fish) 

(symbol->string 'Martin) =* 
(symbol->string 

(string->symbol "Malvina")) 



"flying-fish" 

"martin" 



"Malvina" 



(string->symbol string) essential procedure 

Returns the symbol whose name is string. This procedure 
can create symbols with names containing special charac- 
ters or letters in the non-standard case, but it is usually 
a bad idea to create such symbols because in some imple- 
mentations of Scheme they cannot be read as themselves. 
See symbol->8tring. 

The following examples assume that the implementation's 
standard case is lower case: 

(eq? 'BlSSISSIppi "mississippi) 

==► #t 
(string->symbol "mISSISSIppi") 

=> the symbol with name "mISSISSIppi" 
(eq? 'bitBlt (string->symbol "bitBlt")) 

=*■ #f 
(eq? 'JollyWog 

(string->symbol 

(symbol->string 'JollyWog))) 
=» #t 
(string"? "K. Harper. M.D." 
(symbol->8tring 

(string->symbol "K. Harper, M.D."))) 
==*• #t 



(symbol? obj) essential procedure 

Returns #t if obj is a symbol, otherwise returns #f . 



(symbol? 'foo) 
(symbol? (car '(a b))) 
(symbol? "bar") 



#t 
#t 
#x 



(symbol->8tring symbol) essential procedure 

Returns the name of symbol as a string. If the symbol 
was part of an object returned as the value of a literal 



6.5. Numbers 

Numerical computation has traditionally been neglected by 
the Lisp community. Until Common Lisp there has been 
no carefully thought out strategy for organizing numeri- 
cal computation, and with the exception of the MacLisp 
system [28] there has been little effort to execute numeri- 
cal code efficiently. We applaud the excellent work of the 
Common Lisp committee and we accept many of their rec- 
ommendations. In some ways we simplify and generalize 
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their proposals in a manner consistent with the purposes 
of Scheme. 

Scheme's numerical operations treat numbers as abstract 
data, as independent of their representation as is possible. 
Thus, the casual user should be able to write simple pro- 
grams without having to know that the implementation 
may use fixed-point, floating-point, and perhaps other rep- 
resentations for his data. Unfortunately, this illusion of 
uniformity can be sustained only approximately — the im- 
plementation of numbers will leak out of its abstraction 
whenever the user must be in control of precision, or accu- 
racy, or when he must construct especially efficient com- 
putations. Thus the language must also provide escape 
mechanisms so that a sophisticated programmer can exer- 
cise more control over the execution of his code and the 
representation of his data when necessary. 

It is important to distinguish between the abstract num- 
bers, their machine representations, and their written rep- 
resentations. We will use mathematical terms number, 
complex, real, rational, and integer for properties of 
the abstract numbers, the names f ixnum, bignum, ratnum, 
and f lonum for machine representations, and the names 
int, fix, f lo, sci, rat, polar, and rect for input/output 
formats. 

6.5.1. Numbers 

A Scheme system provides data of type number, which is 
the most general numerical type supported by that sys- 
tem. Number is likely to be a complicated union type im- 
plemented in terms of f ixnums, bignums, f lonums, and so 
forth, but this should not be apparent to a naive user. 
What the user should see is that the usual operations 
on numbers produce the mathematically expected results, 
within the limits of the implementation. Thus if the user 
divides the exact number 3 by the exact number 2, he 
should get something like 1.5 (or the exact fraction 3/2). 
If he adds that result to itself, and the implementation is 
good enough, he should get an exact 3. 

Mathematically, numbers may be arranged into a tower of 
subtypes with projections and injections relating adjacent 
levels of the tower: 

number 

complex 

real 

rational 

integer 

We impose a uniform rule of downward coercion — a num- 
ber of one type is also of a lower type if the injection (up) 
of the projection (down) of a number leaves the number 
unchanged. Since this tower is a genuine mathematical 
structure, Scheme provides predicates and procedures to 
access the tower. 



Not all implementations of Scheme must provide the whole 
tower, but they must implement a coherent subset consis- 
tent with both the purposes of the implementation and the 
spirit of the Scheme language. 

6.5.2. Exactness 

Numbers are either exact or inexact. A number is ex- 
act if it was derived from exact numbers using only exact 
operations. A number is inexact if it models a quantity 
(e.g., a measurement) known only approximately, if it was 
derived using inexact ingredients, or if it was derived us- 
ing inexact operations. Thus inexactness is a contagious 
property of a number. Some operations, such as the square 
root (of non-square numbers) , must be inexact because of 
the finite precision of our representations. Other opera- 
tions are inexact because of implementation requirements. 
We emphasize that exactness is independent of the posi- 
tion of the number on the tower. It is perfectly possible to 
have an inexact integer or an exact real; 355/113 may 
be an exact rational or it may be an inexact rational 
approximation to pi, depending on the application. 

Operationally, it is the system's responsibility to combine 
exact numbers using exact methods, such as infinite pre- 
cision integer and rational arithmetic, where possible. An 
implementation may not be able to do this (if it does not 
use infinite precision integers and rationals), but if a num- 
ber becomes inexact for implementation reasons there is 
likely to be an important error condition, such as integer 
overflow, to be reported. Arithmetic on inexact numbers 
is not so constrained. The system may use floating point 
and other ill-behaved representation strategies for inexact 
numbers. This is not to say that implementors need not 
use the best known algorithms for inexact computations — 
only that approximate methods of high quality are allowed. 
In a system that cannot explicitly distinguish exact from 
inexact numbers the system must do its best to main- 
tain precision. Scheme systems must not burden users 
with numerical operations described in terms of hardware 
and operating-system dependent representations such as 
f ixnum and f lonum, however, because these representation 
issues are hardly ever germane to the user's problems. 

We highly recommend that the IEEE 32-bit and 64-bit 
floating-point standards be adopted for implementations 
that use floating-point representations internally. To min- 
imize loss of precision we adopt the following rules: If an 
implementation uses several different sizes of floating-point 
formats, the results of any operation with a floating-point 
result must be expressed in the largest format used to ex- 
press any of the floating-point arguments to that operation. 
It is desirable (but not required) for potentially irrational 
operations such as sqrt, when applied to exact arguments, 
to produce exact answers whenever possible (for example 
the square root of an exact 4 ought to be an exact 2). If 
an exact number (or an inexact number represented as 
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a f ixnum, a bigimm, or a ratnum) is operated upon so as 
to produce an inexact result (as by sqrt), and if the re- 
sult is represented as a f lonum, then the largest available 
f lonum format must be used; but if the result is expressed 
as a ratnum then the rational approximation must have at 
least as much precision as the largest available f lonum. 

6.5.3. Number syntax 

Scheme allows the traditional ways of writing numerical 
constants, though any particular implementation may sup- 
port only some of them. These syntaxes are intended to 
be purely notational; any kind of number may be written 
in any form that the user deems convenient. Of course, 
writing 1/7 as a limited-precision decimal fraction will not 
express the number exactly, but this approximate form of 
expression may be just what the user wants to see. 

The syntax of numbers is described formally in sec- 
tion 7.1.1. See section 6.5.6 for many examples of rep- 
resentations of numbers. 

A numerical constant may be represented in binary, octal, 
decimal, or hex by the use of a radix prefix. The radix 
prefixes are #b (binary), #o (octal), #d (decimal), and #x 
(hex). With no radix prefix, a number is assumed to be 
expressed in decimal. 

A numerical constant may be specified to be either exact 
or inexact by a prefix. The prefixes are #e for exact, and 
#i for inexact. An exactness prefix may appear before or 
after any radix prefix that is used. If the representation of 
a numerical constant has no exactness prefix, the constant 
may be assumed to be exact or inexact at the discretion of 
the implementation, except that integers expressed without 
decimal points and without use of exponential notation are 
assumed exact. 

In systems with both single and double precision f lonums 
we may want to specify which size we want to use to rep- 
resent a constant internally. For example, we may want 
a constant that has the value of pi rounded to the single 
precision length, or we might want a long number that has 
the value 6/10. In either case, we are specifying an explicit 
way to represent an inexact number. For this purpose, we 
may express a number with a prefix that indicates short or 
long f lonum representation: 

#S3. 14169266358979 

Round to short — 3.141693 
#L.6 

Extend to long — .600000000000000 

6.5.4. Numerical operations 

The reader is referred to section 1.3.3 for a summary of 
the naming conventions used to specify restrictions on the 
types of arguments to numerical routines. 



(number? obj) 
(complex? obj) 
(real? obj) 
(rational? obj) 
(integer? obj) 



essential 
essential 
essential 
essential 
essential 



procedure 
procedure 
procedure 
procedure 
procedure 



These numerical type predicates can be applied to any kind 
of argument, including non-numbers. They return true if 
the object is of the named type. In general, if a type pred- 
icate is true of a number then all higher type predicates 
are also true of that number. Not every system supports 
all of these types; for example, it is entirely possible to 
have a Scheme system that has only integers. Nonethe- 
less every implementation of Scheme must have all of these 
predicates. 



(zero? z) 

(positive? x) 
(negative? as) 
(odd? n) 
(even? n) 
(exact? z) 
(inexact? z) 



essential 
essential 
essential 
essential 
essential 
essential 
essential 



procedure 
procedure 
procedure 
procedure 
procedure 
procedure 
procedure 



These numerical predicates test a number for a particular 
property, returning #t or #f . 



(- Z\ z 2 ) 
(< Xx x 2 ) 
(> xi x 2 ) 
(<- Xl x 2 ) 
(>- Xi x 2 ) 



essential procedure 
essential procedure 
essential procedure 
essential procedure 
essential procedure 



Some implementations allow these procedures to take many 
arguments, to facilitate range checks. These procedures 
return #t if their arguments are (respectively): numeri- 
cally equal, monotonically increasing, monotonically de- 
creasing, monotonically nondecreasing, or monotonically 
nonincreasing. Warning: on inexact numbers the equal- 
ity tests will give unreliable results, and the other numer- 
ical comparisons will be Useful only heuristically; when in 
doubt, consult a numerical analyst. 

(max xi x 2 ) essential procedure 

(max Xi x 2 .. .) procedure 

(min xx x 2 ) essential procedure 

(min xi x 2 .. .) procedure 

These procedures return the maximum or minimum of 
their arguments. 

(+ z\ z 2 ) essential procedure 

(+ z\ ...) procedure 

(* z\ z 2 ) essential procedure 

(* z\ ...) procedure 

These procedures return the sum or product of their argu- 
ments. 
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(+ 3 4) 
(+ 3) 
(+) 

(* 4) 
(*) 



7 
3 


4 

1 



(numerator q) 
(denominator q) 



procedure 
procedure 



(- z\ za) essential procedure 

(- z\ z-i . . . ) procedure 

(/ z\ 22) essential procedure 

(/ z\ z<2 . . • ) procedure 

With two or more arguments, these procedures return the 
difference or quotient of their arguments, associating to the 
left. With one argument, however, they return the additive 
or multiplicative inverse of their argument. 

(-3 4) =*• -1 

(-3 4 6) =► -6 

(- 3) => -3 

(/ 3 4 6) => 3/20 

(/ 3) => 1/3 

(abs z) essential procedure 

Abs returns the magnitude of its argument. 

(abs -7) ==» 7 

(abs -3+41) =>• 5 

(quotient m n.2) essential procedure 

(remainder n\ ris) essential procedure 

(modulo rii ri2) procedure 

These are intended to implement number-theoretic (inte- 
ger) division: For positive integers rii and n.2, if n.3 and r»4 
are integers such that Hi = ri2»i3 + rn and < n* < n-2, 
then 

(quotient tu nj) ==>• 113 

(remainder »i »a) => n* 

(modulo rii ftz) => »4 

For all integers ni and n-2 with n.2 not equal to 0, 

(= »i (+ (* na (quotient »i n 2 )) 
(remainder ttx n.2))) 

=> #t 

The value returned by quotient always has the sign of the 
product of its arguments. Remainder and modulo differ on 
negative arguments — the remainder always has the sign of 
the dividend, the modulo always has the sign of the divisor: 

(modulo 13 4) => 1 

(remainder 13 4) => 1 



(modulo -13 4) 
(remainder -13 4) 

(modulo 13 -4) 
(remainder 13 -4) 

(modulo -13 -4) 

(remainder -13 -4) 



=► 3 

=* -1 

=» -3 

=> 1 

==> -1 



These procedures return the numerator or denominator of 
their argument. 



(numerator (/ 6 4)) 
(denominator (/ 6 4)) 



(gcd nx . . . ) procedure 

(lcm ni . . . ) procedure 

These procedures return the greatest common divisor or 
least common multiple of their arguments. The result is 
always non-negative. 



(gcd 32 -36) 

(gcd) 

(lcm 32 -36) 

(lcm) 








4 


288 
1 




(floor x) 
(ceiling x) 
(truncate x) 
(round x) 
(rationalize x 
(rationalize x) 


y) 








procedure 
procedure 
procedure 
procedure 
procedure 
procedure 


These procedures create integers and rationals. Their re- 
sults are exact if and only if their arguments are exact. 



Floor returns the largest integer not larger than x. Ceil- 
ing returns the smallest integer not smaller than x. Trun- 
cate returns the integer of maximal absolute value not 
larger than the absolute value of x. Round returns the 
closest integer to x, rounding to even when x is halfway 
between two integers. With two arguments, rationalize 
produces the rational number with smallest denominator 
differing from x by no more than y. With one argument, 
rationalize produces the best rational approximation to 
x, preserving all of the precision in its representation. 

Note: Round rounds to even for consistency with the rounding 
modes required by the IEEE floating point standard. 



(exp z) 
(log z) 
(sin z) 
(cos z) 
(tan z) 
(asin z) 
(acos 0) 
(atan z) 
(atan y x) 



procedure 
procedure 
procedure 
procedure 
procedure 
procedure 
procedure 
procedure 
procedure 



These procedures are part of every implementation that 
supports real numbers; they compute the usual transcen- 
dental functions. Log computes the natural logarithm of 
z (not the base 10 logarithm). Asin, acos, and atan 
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compute arcsine (sin -1 ), arccosine (cos -1 ), and arctan- 
gent (tan -1 ), respectively. The two-argument variant of 
atan computes (angle (make -rectangular x {/)) (see 
below), even in implementations that don't support com- 
plex numbers. 

In general, the mathematical functions log, arcsine, arcco- 
sine, and arctangent are multiply defined. For nonzero real 
x, the value of log x is defined to be the one whose imagi- 
nary part lies in the range — ir (exclusive) to k (inclusive). 
logO is undefined. The value of log 2 when z is complex is 
defined according to the formula 

logz = log magnitude (z) + i angle(z) 

With log defined this way, the values of sin 

and tan -1 z are according to the following formulae: 

-1 z = — i logo's + y/l — z 2 ) 

z = —i log(z + 1 i v 1 



z, cos 1 z, 



sin 



cos 



-1 



2 ) 



tan -1 z = -i log((l + iz)y/l/{l +* 2 )) 

The above specification follows [43], which in turn fol- 
lows [26]; refer to these sources for more detailed discussion 
of branch cuts, boundary conditions, and implementation 
of these functions. 



(sqrt z) procedure 

Returns the principal square root of z. The result will have 
either positive real part, or zero real part and non-negative 
imaginary part. 



(expt z\ Z2) 

Returns Z\ raised to the power z^: 



procedure 



Zl z > = e * alo s* 1 



0° is defined to be equal to 1. 

(make-rectangular x% X2) procedure 

(make -polar X3 X4) procedure 

(real-part z) procedure 

(imag-part z) procedure 

(magnitude z) procedure 

(angle z) procedure 

These procedures are part of every implementation that 
supports complex numbers. Suppose x\, X2, X3, and X4 
are real numbers and z is a complex number such that 

z = xi + x%i = x$ • e lXi 

Then make-rectangular and make-polar return z, real- 
part returns xi, imag-part returns 12, magnitude returns 
X3, and angle returns £4. In the case of angle, whose value 
is not uniquely determined by the preceding rule, the value 
returned will be the one in the range — tt (exclusive) to it 
(inclusive). 



Note: Magnitude is the same as abs, but aba must be present 
in all implementations, whereas aagnitude will only be present 
in implementations that support complex numbers. 



(exact->inexact z) 

(inexact->exact z) 



procedure 
procedure 



Exact->inexact returns an inexact representation of 2, 
which is a fairly harmless thing to do. Inexact->exact 
returns an exact representation of z. Since the law of 
"garbage in, garbage out" remains in force, inexact-> 
exact should not be used casually. 

6.5.5. Numerical input and output 



(number->string number format) procedure 

The conventions used to produce the printed representa- 
tion of a number can be specified by a format, as described 
in section 6.5.6. The procedure number->string takes a 
number and a format and returns as a string the printed 
representation of the given number in the given format. 
This procedure will mostly be used by sophisticated users 
and in system programs. In general, a naive user will need 
to know nothing about the formats because the system 
printer will have reasonable default formats for all types of 
numbers. 



(string->number string exactness radix) procedure 

The system reader will construct reasonable default numer- 
ical types for numbers expressed in each of the formats it 
recognizes. A user who needs control of the coercion from 
strings to numbers will use string->number. Exactness 
must be a symbol, either E (for exact) or I (for inexact). 
Radix must also be a symbol: B for binary, for octal, D for 
decimal, and X for hexadecimal. Returns a number of the 
maximally precise representation expressed by the given 
string. It is an error if string does not express a number 
according to the grammar in section 7.1.1. 

6.5.6. Formats 

A format is a list beginning with a format descriptor, which 
is a symbol such as sci. Following the descriptor are pa- 
rameters used by that descriptor, such as the number of 
significant digits to be used. Default values are supplied 
for any parameters that are omitted. Modifiers may appear 
after the parameters, such as the radix and exactness for- 
mats described below, which themselves take parameters. 

Details of particular formats such as sci and fix are given 
in section 6.5.7. 

For example, the format (sci 5 2 (exactness s)) spec- 
ifies that a number is to be expressed in Fortran scien- 
tific format with 5 significant places and two places after 
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(he radix point, and that its exactness prefix is to be sup- 
pressed. 

In the following examples, the comment shows the format 
that was used to produce the output shown: 



6023800######*##*####### 



123 +123 -123 

123466789012346678901234567 

356/113 +366/113 -366/113 

+123.45 -123.46 

3.14169266368979 

3.14169266368979 

123.460 

-123.456-1 

123e3 123e-3 -123e-3 

-1+21 

1.281.570796 



(int) 
(int) 
(rat) 
(fix 2) 
(fix 14) 
(flo 16) 
(flo 6) 
(scl 6 2) 
(sci 3 0) 

(rect (int) (int)) 
(polar (fix 1) 
(flo 7)) 



A format may specify that a number should be expressed 
in a particular radix. The radix prefix may also be sup- 
pressed. For example, one may express a complex number 
in polar form with the magnitude in octal and the angle in 
decimal as follows: 



#ol.2fi#d 1.570796327 



#01.201.670796327 



(polar (flo 2 (radix o)) 

(flo (radix d))) 

(polar (flo 2 (radix o)) 

(flo (radix da))) 



A format may specify that a number should be expressed 
with an explicit exactness prefix ((exactness e)), or it 
may force the exactness to be suppressed ((exactness 
s)). For example, the following are ways to express an 
inexact value for pi: 



#1365/113 

355/113 

#i3.1416 



(rat (exactness e)) 
(rat (exactness a)) 
(fix 4 (exactness e)) 



An attempt to produce more digits than are available in 
the internal machine representation of a number will be 
marked with a a #" filling the extra digits. This is not a 
statement that the implementation knows or keeps track 
of the significance of a number, just that the machine will 
flag attempts to produce 20 digits of a number that has 
only 15 digits of machine representation: 



3.14158265358979##### 

6.5.7. Details of formats 
The format descriptors are: 



(flo 20 (exactness a)) 



(xat n) 



format 



Express as a rational fraction, n specifies the largest de- 
nominator to be used in constructing a rational approxi- 
mation to the number being expressed. If n is omitted it 
defaults to infinity. 



(fix n) 



format 



Express with a fixed radix point, n specifies the number 
of places to the right of the radix point, n defaults to the 
size of a single-precision f lonum. If there are not enough 
significant places, then insignificant digits will be flagged. 
For example, an inexact 6.0238 • 10 23 (represented inter- 
nally as a 7 digit f lonum) would be printed with a (fix 
2) format as 



6023800################# .## 



(flo n) 



format 



Express with a floating radix point, n specifies the total 
number of places to be displayed, n defaults to the size of 
a single-precision f lonum. If the number is out of range, 
it is converted to (sci). (flo h) expresses n in floating 
point format heuristically for human consumption. 



(sci n m) 



format 



Express in exponential notation, n specifies the total num- 
ber of places to be displayed, n defaults to the size of a 
single-precision f lonum. m specifies the number of places 
to the right of the radix point, m defaults to n — 1. (sci 
h) does heuristic expression. 



(rect r i) 



format 



Express as a rectangular form complex number, r and i 
are formats for the real and imaginary parts respectively. 
They default to (heur) . 

(polar m o) format 

Express as a polar form complex number, m and a are 
formats for the magnitude and angle respectively, m and 
a default to (heur) . 



(int) 



format (heur) 



Express as an integer. The radix point is implicit. If 
there are not enough significant places then insignificant 
digits will be flagged. For example, an inexact integer 
6.0238 • 10 23 (represented internally as a 7 digit f lonum) 
would be printed as 



format 



Express heuristically using the minimum number of digits 
required to get an expression that when coerced back to 
a number produces the original machine representation. 
Exact numbers are expressed as (int) or (rat). Inexact 
numbers are expressed as (flo h) or (sci h) depending 
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on their range. Complex numbers are expressed in (rect) . 
This is the normal default of the system printer. 

The following modifiers may be added to a numerical for- 
mat specification: 



(exactness s) 



format 



This controls the expression of the exactness prefix of a 
number, s must be a symbol, either E or S, indicating 
whether the exactness is to be expressed or suppressed, re- 
spectively. If no exactness modifier is specified for a format 
then the exactness is by default suppressed. 



(radix r s) 



format 



This forces a number to be expressed in the radix r. r may 
be the symbol B (binary), (octal), D (decimal), or X (hex). 
s must be a symbol, either E or S, indicating whether the 
radix prefix is to be expressed or suppressed, respectively. 
s defaults to E (expressed). If no radix modifier is specified 
then the default is decimal and the prefix is suppressed. 

6.6. Characters 

Characters are objects that represent printed characters 
such as letters and digits. There is no requirement that the 
data type of characters be disjoint from other data types; 
implementations are encouraged to have a separate char- 
acter data type, but may choose to represent characters as 
integers, strings, or some other type. 

Characters are written using the notation #\ (character) or 
#\ (character name). For example: 



#\a 
#\A 
#\( 
#\ 

#\space 
#\newline 



lower case letter 

upper case letter 

left parenthesis 

the space character 

the preferred way to write a space 

the newline character 



Case is significant in #\ (character), but not in #\ (character 
name). If (character) in #\(character) is alphabetic, then 
the character following (character) must be a delimiter 
character such as a space or parenthesis. This rule resolves 
the ambiguous case where, for example, the sequence of 
characters a #\space" could be taken to be either a repre- 
sentation of the space character or a representation of the 
character "#\s" followed by a representation of the symbol 
"pace." 

Characters written in the #\ notation are self-evaluating. 
That is, they do not have to be quoted in programs. The #\ 
notation is not an essential part of Scheme, however. Even 



implementations that support the #\ notation for input do 
not have to support it for output. 

Some of the procedures that operate on characters ignore 
the difference between upper case and lower case. The pro- 
cedures that ignore case have the suffix "-ci* (for "case in- 
sensitive"). If the operation is a predicate, then the "-ci* 
suffix precedes the "?* at the end of the name. 

(char? obj) essential procedure 

Returns #t if obj is a character, otherwise returns #f . 



(char=? chari char 2 ) 
(char<? chari chari) 
(char>? chari char 2 ) 
(char<-? chari cAar2) 
(char>»? chari char 2 ) 



essential procedure 
essential procedure 
essential procedure 
essential procedure 
essential procedure 



These procedures impose a total ordering on the set of 
characters. It is guaranteed that under this ordering: 

• The upper case characters are in order. For example, 
(char<? #\A #\B) returns #t. 

• The lower case characters are in order. For example, 
(char<? #\a #\b) returns #t. 

• The digits are in order. For example, (char<? #\0 
#\9) returns #t. 

• Either all the digits precede all the upper case letters, 
or vice versa. 

• Either all the digits precede all the lower case letters, 
or vice versa. 

Some implementations may generalize these procedures to 
take more than two arguments, as with the corresponding 
numeric predicates. 



(char-ci=? chari char 2 ) 
(char-ci<? chari char 2 ) 
(char-ci>? chari char 2 ) 
(char-ci<«? chari char 2 ) 
(char-ci>=? char x char 2 ) 



procedure 
procedure 
procedure 
procedure 
procedure 



These procedures are similar to char=? et cetera, but they 
treat upper case and lower case letters as the same. For 
example, (char-ci-? #\A #\a) returns #t. Some imple- 
mentations may generalize these procedures to take more 
than two arguments, as with the corresponding numeric 
predicates. 

(char-alphabetic? char) procedure 

(char-numeric? char) procedure 

(char-whitespace? cAar) procedure 

These procedures return #t if their arguments are alpha- 
betic, numeric, or whitespace characters, respectively, oth- 
erwise they return #f . The following remarks, which are 



24 Revised 3 Scheme 



specific to the ASCII character set, are intended only as 
a guide: The alphabetic characters are the 52 upper and 
lower case letters. The numeric characters are the ten dec- 
imal digits. The whitespace characters are space, tab, line 
feed, form feed, and carriage return. 



(char-upper-case? letter) 
(char-lower-case? letter) 



procedure 
procedure 



Letter must be an alphabetic character. These procedures 
return #t if their arguments are upper case or lower case 
characters, respectively, otherwise they return #f . 



(char->integer char) 
(integer->char n) 



essential procedure 
essential procedure 



Given a character, char->integer returns an integer rep- 
resentation of the character. Given an integer that is the 
image of a character under char->integer, integer-> 
char returns a character. These procedures implement or- 
der isomorphisms between the set of characters under the 
char<=? ordering and some subset of the integers under 
the <= ordering. That is, if 



(char<»? a b) 



#t and (<■ 



y) 



#t 



and x and y are in the range of char->integer, then 



(<* (char->integer a) 

(char->integer 6)) 

(char<=? (integer->char x) 
(integer->char y)) 



(char-upcase char) 
(char-downcase char) 



#t 



#t 



procedure 
procedure 



These procedures return a character char^ such that 
(char-ci=? char char^j. In addition, if char is alpha- 
betic, then the result of char-upcase is upper case and 
the result of char-downcase is lower case. 



6.7. Strings 

Strings are sequences of characters. In some implemen- 
tations of Scheme they are immutable; other implementa- 
tions provide destructive procedures such as string- set! 
that alter string objects. 

Strings are written as sequences of characters enclosed 
within doublequotes ("). A doublequote can be written 
inside a string only by escaping it with a backslash (\), as 
in 

"The word \"recursion\" has many meanings." 



A backslash can be written inside a string only by escaping 
it with another backslash. Scheme does not specify the 
effect of a backslash within a string that is not followed by 
a doublequote or backslash. 

A string may continue from one line to the next, but this is 
usually a bad idea because the exact effect may vary from 
one computer system to another. 

The length of a string is the number of characters that 
it contains. This number is a non-negative integer that 
is fixed when the string is created. The valid indexes of 
a string are the exact non-negative integers less than the 
length of the string. The first character of a string has 
index 0, the second has index 1, and so on. 

In phrases such as "the characters of string beginning with 
index start and ending with index end," it is understood 
that the index start is inclusive and the index end is ex- 
clusive. Thus if start and end are the same index, a null 
substring is referred to, and if start is zero and end is the 
length of string, then the entire string is referred to. 

Some of the procedures that operate on strings ignore the 
difference between upper and lower case. The versions that 
ignore case have the suffix "-ci" (for "case insensitive"). If 
the operation is a predicate, then the "-ci" suffix precedes 
the "?" at the end of the name. 

(string? obj) essential procedure 

Returns #t if obj is a string, otherwise returns #f . 

(make-string k) procedure 

(make-string k char) procedure 

k must be a non-negative integer, and char must be a char- 
acter. Make-string returns a newly allocated string of 
length k. If char is given, then all elements of the string 
are initialized to char, otherwise the contents of the string 
are unspecified. 

(string-length string) essential procedure 

Returns the number of characters in the given string. 

(string-ref string k) essential procedure 

k must be a valid index of string. String-ref returns 
character k of string using zero-origin indexing. 

(string-set! string k char) procedure 

k must be a valid index of string. String-set ! stores char 
in element fc of string and returns an unspecified value. 

(string=? stringi stringz) essential procedure 

(string-ci-? stringi string) procedure 

Returns #t if the two strings are the same length and con- 
tain the same characters in the same positions, otherwise 
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returns #f . String-ci-? treats upper and lower case let- 
ters as though they were the same character, but string=? 
treats upper and lower case as distinct characters. 



(string<? stringi stringi) 
(string>? stringi string2) 
(string<=? stringi s<rtnjr 2 ) 
(string^? stringi string2) 
(string-ci<? stringi string2) 
(string-ci>? stringi string?) 
(string-ci<=? stringi string 2 ) 
(string-ci>=? stringi string2) 



essential procedure 

essential procedure 

essential procedure 

essential procedure 

procedure 

procedure 

procedure 

procedure 



These procedures are the lexicographic extensions to 
strings of the corresponding orderings on characters. For 
example, string<? is the lexicographic ordering on strings 
induced by the ordering char<? on characters. If two 
strings differ in length but are the same up to the length 
of the shorter string, the shorter string is considered to be 
lexicographically less than the longer string. 

Implementations may generalize these and the string"? 
and string-ci»? procedures to take more than two argu- 
ments, as with the corresponding numeric predicates. 

(substring string start end) essential procedure 

String must be a string, and start and end must be exact 
integers satisfying 

< start < end < (string- length string). 
Substring returns a newly allocated string formed from 
the characters of string beginning with index start (inclu- 
sive) and ending with index end (exclusive). 

(string-append stringi string2) essential procedure 

(string-append string ...) procedure 

Returns a new string whose characters form the concate- 
nation of the given strings. 

(string->list string) essential procedure 

(list->string chars) essential procedure 

String->list returns a newly allocated list of the char- 
acters that make up the given string. List->string re- 
turns a string formed from the characters in the list chars. 
String->list and list->string are inverses so far as 
equal? is concerned. Implementations that provide de- 
structive operations on strings should ensure that the re- 
sult of list->string is newly allocated. 

(string-copy string) procedure 

Returns a newly allocated copy of the given string. 

(string-fill! string char) procedure 

Stores char in every element of the given string and returns 
an unspecified value. 



6.8. Vectors 

Vectors are heterogenous mutable structures whose ele- 
ments are indexed by integers. 

The length of a vector is the number of elements that it 
contains. This number is a non-negative integer that is 
fixed when the vector is created. The valid indexes of a 
vector are the exact non-negative integers less than the 
length of the vector. The first element in a vector is indexed 
by zero, and the last element is indexed by one less than 
the length of the vector. 

Vectors are written using the notation #(06/ ...). For 
example, a vector of length 3 containing the number zero 
in element 0, the list (2 2 2 2) in element 1, and the 
string "Anna" in element 2 can be written as following: 

#(0 (2 2 2 2) "Anna") 

Note that this is the external representation of a vector, not 
an expression evaluating to a vector. Like list constants, 
vector constants must be quoted: 

'#(0 (2 2 2 2) "Anna") 

=> #(0 (2 2 2 2) "Anna") 

(vector? obj) essential procedure 

Returns #t if obj is a vector, otherwise returns #f . 

(make-vector k) essential procedure 

(make-vector ifc fill) procedure 

Returns a newly allocated vector of Jb elements. If a second 
argument is given, then each element is initialized to fill. 
Otherwise the initial contents of each element is unspeci- 
fied. 

(vector obj ...) essential procedure 

Returns a newly allocated vector whose elements contain 
the given arguments. Analogous to list. 

(vector 'a 'b *c) =*■ #(a b c) 

(vector-length vector) essential procedure 

Returns the number of elements in vector. 

(vector-ref vector k) essential procedure 

A: must be a valid index of vector. Vector-ref returns the 
contents of element k of vector. 

(vector-ref '#(1 1 2 3 6 8 13 21) 6) =*> 8 

(vector-set! vector k obj) essential procedure 

k must be a valid index of vector. Vector-set ! stores obj 
in element k of vector. The value returned by vector-set ! 
is unspecified. 
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(let ((vec (vector '(2222) "Anna"))) 
(vector-set! vec i '("Sue" "Sue")) 
vec) 

=*• #(0 ("Sue" "Sue") "Anna") 



(vector->list vector) 
(list->vector list) 



essential procedure 
essential procedure 

Vector->list returns a newly created list of the objects 
contained in the elements of vector. List->vector returns 
a newly created vector initialized to the elements of the list 

list. 

(vector-Mist '#(dah dan didah)) 

=*• (dan dan didah) 
(list->vector '(dididit dah)) 

=> # (dididit dah) 



(vector-fill I vector fill) procedure 

Stores fill in every element of vector. The value returned 
by vector-fill! is unspecified. 

6.9. Control features 

This chapter describes various primitive procedures which 
control the flow of program execution in special ways. The 
procedure? predicate is also described here. 

(procedure? obj) essential procedure 

Returns #t if obj is a procedure, otherwise returns #f . 

(procedure? car) ==>• #t 

(procedure? 'car) => #f 

(procedure? (lambda (x) (* x x))) 

=> #t 
(procedure? '(lambda (x) (* x x))) 

==> #f 
(call-with-current-continuation procedure?) 

#t 



(apply proc args) essential procedure 

(apply proc argi . . . args) procedure 

Proc must be a procedure and args must be a list. The 
first (essential) form calls proc with the elements of args as 
the actual arguments. The second form is a generalization 
of the first that calls proc with the elements of the list 
(append (list argi • • • ) o.rgs) as the actual arguments. 

(apply + (list 3 4)) => 7 

(define compose 
(lambda (f g) 
(lambda args 

(f (apply g args))))) 



(map proc list) essential procedure 

(map proc listi list<2 ...) procedure 

The lista must be lists, and proc must be a procedure taking 
as many arguments as there are lists. If more than one list 
is given, then they must all be the same length. Map applies 
proc element-wise to the elements of the lists and returns 
a list of the results. The order in which proc is applied to 
the elements of the lists is unspecified. 

(map cadr '((a b) (d e) (g h))) 
=> (b e h) 

(map (lambda (n) (expt n n)) 
'(12 3 4 5)) 

=► (1 4 27 256 3126) 

(map + '(1 2 3) '(4 5 6)) ==» (B 7 9) 

(let ((count 0)) 

(map (lambda (ignored) 

(set! count (+ count 1)) 
count) 
'(a b c))) => unspecified 



(f or -each proc list) 
(for-each proc list\ Ztsfe •••) 



essential procedure 
procedure 



((compose sqrt *) 12 76) 
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The arguments to for-each are like the arguments to map, 
but for-each calls proc for its side effects rather than for 
its values. Unlike map, for-each is guaranteed to call proc 
on the elements of the lista in order from the first element to 
the last, and the value returned by for-each is unspecified. 

(let ((v (make-vector 6))) 
(for-each (lambda (i) 

(vector-set! v i (* i i))) 
'(01234)) 

v) => 1(0149 16) 



(force promise) procedure 

Forces the value of promise (see delay, section 4.2.5). If 
no value has been computed for the promise, then a value 
is computed and returned. The value of the promise is 
cached (or "memoized" ) so that if it is forced a second 
time, the previously computed value is returned without 
any recomputation. 

(force (delay (+12))) ==> 3 

(let ((p (delay (+ 1 2)))) 
(list (force p) (force p))) 

=► (3 3) 

(define a-stream 
(letrec ((next 

(lambda (n) 

(cons n (delay (next (+ n 1))))))) 
(next 0))) 
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(define head car) 
(define tail 

(lambda (stream) (force (cdr stream)))) 

(head (tail (tail a-stream))) 



Force and delay are mainly intended for programs writ- 
ten in functional style. The following examples should not 
be considered to illustrate good programming style, but 
they illustrate the property that the value of a promise is 
computed at most once. 



(define count 0) 






(define p 


(delay (begin 


(set! count (+ count 1)) 






(* 


x 3)))) 


(define x 


6) 






count 






=*• 


P 






=> a promise 


(force p) 






=>• 16 


P 






=>■ a promise, still 


count 






=>> 1 


(force p) 






=*> IB 


count 






=> 1 



Here is a possible implementation of delay and force. We 
define the expression 

(delay (expression)) 
to have the same meaning as the procedure call 

(make-promise (lambda () (expression})), 

where make-promise is defined as follows: 

(define make-promise 
(lambda (proc) 

(let ((already-run? #f) (result #f)) 
(lambda () 

(cond ((not already-run?) 

(set! result (proc)) 
(set! already-run? #t))) 
result)))) 

Promises are implemented here as procedures of no argu- 
ments, and force simply calls its argument. 

(define force 

(lambda (object) 
(object))) 

Various extensions to this semantics of delay and force 
are supported in some implementations: 

• Calling force on an object that is not a promise may 
simply return the object. 

• It may be the case that there is no means by which 
a promise can be operationally distinguished from its 
forced value. That is, expressions like the following 
may evaluate to either #t or to #f , depending on the 
implementation: 



(eqv? (delay i) 1) => unspecified 

(pair? (delay (cons 1 2))) => unspecified 

Some implementations will implement "implicit forc- 
ing," where the value of a promise is forced by primi- 
tive procedures like cdr and +: 



(+ (delay (* 3 7)) 13) 
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(call-with-current-continuation proc) 

essential procedure 

Proc must be a procedure of one argument. The procedure 
call-with-current-continuation packages up the cur- 
rent continuation (see the rationale below) as an "escape 
procedure" and passes it as an argument to proc. The 
escape procedure is a Scheme procedure of one argument 
that, if it is later passed a value, will ignore whatever con- 
tinuation is in effect at that later time and will give the 
value instead to the continuation that was in effect when 
the escape procedure was created. 

The escape procedure created by call-with-current- 
continuation has unlimited extent just like any other pro- 
cedure in Scheme. It may be stored in variables or data 
structures and may be called as many times as desired. 

The following examples show only the most common uses of 
call-with-current-continuation. If all real programs 
were as simple as these examples, there would be no need 
for a procedure with the power of call-with-current- 
continuation. 

(call-with-current-continuation 
(lambda (exit) 

(for-each (lambda (x) 

(if (negative? x) 
(exit x))) 
'(64 37 -3 246 19)) 
#t)) =*• -3 

(define list-length 
(lambda (obj) 

(call-with-current-continuation 
(lambda (return) 
(letrec ((r 

(lambda (obj) 

(cond ((null? obj) 0) 
((pair? obj) 

(+ (r (cdr obj)) 1)) 
(else (return «f )))))) 
(r obj)))))) 



(list-length '(1 2 3 4)) 
(list-length "(a b . c)) 



4 
#f 



Rationale: A common use of call-with-current- 

continuation is for structured, non-local exits from loops 
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or procedure bodies, but in fact call-with-current- 
continuation is extremely useful for implementing a wide va- 
riety of advanced control structures. 

Whenever a Scheme expression is evaluated there is a contin- 
uation wanting the result of the expression. The continuation 
represents an entire (default) future for the computation. If 
the expression is evaluated at top level, for example, then the 
continuation will take the result, print it on the screen, prompt 
for the next input, evaluate it, and so on forever. Most of the 
time the continuation includes actions specified by user code, 
as in a continuation that will take the result, multiply it by 
the value stored in a local variable, add seven, and give the 
answer to the top level continuation to be printed. Normally 
these ubiquitous continuations are hidden behind the scenes and 
programmers don't think much about them. On rare occasions, 
however, a programmer may need to deal with continuations ex- 
plicitly. Call-with-current-continuation allows Scheme pro- 
grammers to do that by creating a procedure that acts just like 
the current continuation. 

Most programming languages incorporate one or more special- 
purpose escape constructs with names like exit, return, or 
even goto. In 1965, however, Peter Landin [21] invented a 
general purpose escape operator called the J-operator. John 
Reynolds [32] described a simpler but equally powerful con- 
struct in 1972. The catch special form described by Sussman 
and Steele in the 1975 report on Scheme is exactly the same as 
Reynolds's construct, though its name came from a less general 
construct in MacLisp. Several Scheme implementors noticed 
that the full power of the catch construct could be provided by 
a procedure instead of by a special syntactic construct, and the 
name call-with-current-continuation was coined in 1982. 
This name is descriptive, but opinions differ on the merits of 
such a long name, and some people use the name call/cc in- 
stead. 

6.10. Input and output 
6.10.1. Ports 

Ports represent input and output devices. To Scheme, an 
input device is a Scheme object that can deliver characters 
upon command, while an output device is a Scheme object 
that can accept characters. 

(call-with-input-f ile string proc) 

essential procedure 
(call-with-output-f ile string proc) 

essential procedure 

Proc should be a procedure of one argument, and string 
should be a string naming a file. For call-with-input- 
f ile, the file must already exist; for call-with-output- 
f ile, the effect is unspecified if the file already exists. 
These procedures call proc with one argument: the port 
obtained by opening the named file for input or output. 
If the file cannot be opened, an error is signalled. If the 
procedure returns, then the port is closed automatically 



and the value yielded by the procedure is returned. If the 
procedure does not return, then Scheme will not close the 
port unless it can prove that the port will never again be 
used for a read or write operation. 

Rationale: Because Scheme's escape procedures have unlim- 
ited extent, it is possible to escape from the current contin- 
uation but later to escape back in. If implementations were 
permitted to close the port on any escape from the current con- 
tinuation, then it would be impossible to write portable code 
using both call-with-current-continuation and call-with- 
input-f ile or call-with-output-f ile. 



(input-port? obj) 
(output-port? obj) 



essential procedure 
essential procedure 



Returns #t if obj is an input port or output port respec- 
tively, otherwise returns #f . 



(current - input -port) 
(current - output -port) 



essential procedure 
essential procedure 



Returns the current default input or output port. 

(with-input-from-f ile string thunk) procedure 

(with-output-to-file string thunk) procedure 

Thunk must be a procedure of no arguments, and string 
must be a string naming a file. For with- input -from- 
f ile, the file must already exist; for with-output-to- 
file, the effect is unspecified if the file already exists. 
The file is opened for input or output, an input or output 
port connected to it is made the default value returned 
by current -input -port or current-output-port, and 
the thunk is called with no arguments. When the thunk 
returns, the port is closed and the previous default is 
restored. With-input-from-f ile and with-output-to- 
file return the value yielded by thunk. If an escape pro- 
cedure is used to escape from the continuation of these 
procedures, their behavior is implementation dependent. 



(open-input-file Rename) 



procedure 



Takes a string naming an existing file and returns an input 
port capable of delivering characters from the file. If the 
file cannot be opened, an error is signalled. 



(open-output-file filename) procedure 

Takes a string naming an output file to be created and 
returns an output port capable of writing characters to a 
new file by that name. If the file cannot be opened, an 
error is signalled. If a file with the given name already 
exists, the effect is unspecified. 
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(close-input-port port) 
(close-output-port port) 



procedure 
procedure 



Closes the file associated with port, rendering the port in- 
capable of delivering or accepting characters. These rou- 
tines have no effect if the file has already been closed. The 
value returned is unspecified. 



essential procedure 
essential procedure 



6.10.2. Input 

(read) 
(read port) 

Read converts written representations of Scheme objects 
into the objects themselves. That is, it is a parser for the 
nonterminal (datum) (see section 7.1.2). Read returns the 
next object parsable from the given input port, updating 
port to point to the first character past the end of the 
written representation of the object. 

If an end of file is encountered in the input before any char- 
acters are found that can begin an object, then an end of 
file object is returned. The port remains open, and fur- 
ther attempts to read will also return an end of file object. 
If an end of file is encountered after the beginning of an 
object's written representation, but the written represen- 
tation is incomplete and therefore not parsable, an error is 
signalled. 

The port argument may be omitted, in which case it de- 
faults to the value returned by current -input -port. It is 
an error to read from a closed port. 

(read-char) essential procedure 

(read-char port) essential procedure 

Returns the next character available from the input port, 
updating the port to point to the following character. If 
no more characters are available, an end of file object is 
returned. Port may be omitted, in which case it defaults 
to the value returned by current -input -port. 

(char-ready?) procedure 

(char-ready? port) procedure 

Returns #t if a character is ready on the input port and 
returns #f otherwise. If char-ready returns #t then the 
next read-char operation on the given port is guaranteed 
not to hang. If the port is at end of file then char-ready? 
returns #t. Port may be omitted, in which case it defaults 
to the value returned by current -input -port. 

Rationale: Char-ready? exists to make it possible for a pro- 
gram to accept characters from interactive ports without getting 
stuck waiting for input. Any input editors associated with such 
ports must ensure that characters whose existence has been as- 
serted by char-ready? cannot be rubbed out. If char-ready? 
were to return #f at end of file, a port at end of file would 



be indistinguishable from an interactive port that has no ready 
characters. 

(eof -object? obj) essential procedure 

Returns #t if obj is an end of file object, otherwise returns 
#f . The precise set of end of file objects will vary among 
implementations, but in any case no end of file object will 
ever be a character or an object that can be read in using 
read. 

6.10.S. Output 

(write obj) essential procedure 

(write obj port) essential procedure 

Writes a representation of obj to the given port. Strings 
that appear in the written representation are enclosed in 
doublequotes, and within those strings backslash and dou- 
blequote characters are escaped by backslashes. Write re- 
turns an unspecified value. The port argument may be 
omitted, in which case it defaults to the value returned by 
current -output -port . 

(display obj) essential procedure 

(display obj port) essential procedure 

Writes a representation of obj to the given port. Strings 
that appear in the written representation are not enclosed 
in doublequotes, and no characters are escaped within 
those strings. In those implementations that have a dis- 
tinct character type, character objects appear in the repre- 
sentation as if written by write-char instead of by write. 
Display returns an unspecified value. The port argument 
may be omitted, in which case it defaults to the value re- 
turned by current-output-port. 

Rationale: Write is intended for producing machine-readable 
output and display is for producing human-readable output. 
Implementations that allow "slashification* within symbols will 
probably want write but not display to slashify funny charac- 
ters in symbols. 

(newline) essential procedure 

(newline port) essential procedure 

Writes an end of line to port. Exactly how this is done 
differs from one operating system to another. Returns an 
unspecified value. The port argument may be omitted, in 
which case it defaults to the value returned by current - 
output -port. 

(write-char char) essential procedure 

(write-char char port) essential procedure 

Writes the character char (not a written representation of 
the character) to the given port and returns an unspecified 
value. The port argument may be omitted, in which case it 
defaults to the value returned by current-output-port. 
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6.10.4. User interface 

Questions of user interface generally fall outside of the do- 
main of this report. However, the following operations are 
important enough to deserve description here. 



7. Formal syntax and semantics 

This chapter provides formal descriptions of what has al- 
ready been described informally in previous chapters of this 
report. 



(load filename) essential procedure 

Filename should be a string naming an existing file con- 
taining Scheme source code. The load procedure reads ex- 
pressions and definitions from the file and evaluates them 
sequentially. It is unspecified whether the results of the 
expressions are printed. The load procedure does not 
affect the values returned by current -input -port and 
current-output-port. Load returns an unspecified value. 

Note: For portability, load must operate on source files. Its 
operation on other kinds of files necessarily varies among im- 
plementations. 



(transcript-on filename) 
(transcript-off) 



procedure 
procedure 



Filename must be a string naming an output file to be cre- 
ated. The effect of transcript-on is to open the named 
file for output, and to cause a transcript of subsequent 
interaction between the user and the Scheme system to 
be written to the file. The transcript is ended by a call 
to transcript-off, which closes the transcript file. Only 
one transcript may be in progress at any time, though some 
implementations may relax this restriction. The values re- 
turned by these procedures are unspecified. 




7.1. Formal syntax 

This section provides a formal syntax for Scheme written 
in an extended BNF. The syntax for the entire language, 
including features which are not essential, is given here. 

All spaces in the grammar are for legibility. Case is insignif- 
icant; for example, #xlA and #Xla are equivalent, (empty) 
stands for the empty string. 

The following extensions to BNF are used to make the de- 
scription more concise: (thing)* means zero or more occur- 
rences of (thing); and (thing)" 1 " means at least one (thing). 

7.1.1. Lexical structure 

This section describes how individual tokens (identifiers, 
numbers, etc.) are formed from sequences of characters. 
The following sections describe how expressions and pro- 
grams are formed from sequences of tokens. 

(Intertoken space) may occur on either side of any token, 
but not within a token. 

Tokens which require implicit termination (identifiers, 
numbers, characters, and dot) may be terminated by any 
(delimiter), but not necessarily by anything else. 

(token) — » (identifier) | (boolean) | (number) 
| (character) | (string) 

i ( i > i #( i • i * i . i .• i . 

(delimiter) — ► (whitespace) | ( | ) | " | ; 

(whitespace) — ► (space or newline) 

(comment) — ► ; (all subsequent characters up to a 

line break) 
(atmosphere) — ► (whitespace) | (comment) 
(intertoken space) — * (atmosphere)* 

(identifier) — ► (initial) (subsequent)* 

| (peculiar identifier) 
(initial) — ► (letter) | (special initial) 



< I" 



(letter) — > a |b | c | ... | z 




(special initial) — ► ! | $ | % | 4 | 

1 > 1 ? 1 " 1 - 1 ~ 
(subsequent) — ► (initial) | (digit) 


* 1 / 1 = 




| (special subsequent) 




(digit) — +0|1|2|3|4|S| 


6 | 7 | 8 | 


(special subsequent) — ► . | + | - 




(peculiar identifier) — ► + | - 




(syntactic keyword) — ► (expression 


keyword) 


| else | => | define 




| unquote | unquote-splicing 


(expression keyword) — ► quote | 1 


Lambda ] if 
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| set! | begin | cond | and | or | case 
I let | let* | letrec | do | delay 
| quasiquote 

(variable) — ► (any (identifier) which isn't 
also a (syntactic keyword)) 

(boolean) — ► #t | #f 
(character) — ► #\ (any character) 

| #\ (character name) 
(character name) — ► space | newline 

(string) — ► " (string element)* " 

(string element) — ► (any character other than M or \) 

I \" I w 

(number) — ► (real) | (real) + (ureal) i 
| (real) - (ureal) i | (real) « (real) 
(real) — ► (sign) (ureal) 
(ureal) — ► (ureal 2) | (ureal 8) | (ureal 10) | (ureal 16) 

The following rules for (ureal R), (uinteger R), and (prefix 
R) should be replicated for R = 2, 8, 10, and 16: 

(ureal R) — ► (prefix R) (uinteger R) (suffix) 

| (prefix R) (uinteger R) / (uinteger R) (suffix) 
(digit R) + #* (suffix) 

. (digit R)* #* (suffix) 
#* . #* (suffix) 
(prefix R) — > (radix R) (exactness) (precision) 
| (radix R) (precision) (exactness) 
(exactness) (radix R) (precision) 
(precision) (radix R) 
(radix R) (exactness) 
(exactness) (radix R) 
(digit R) + #* 



(prefix R) 

(prefix R) (digit jR) + 

(prefix R) (digit R)+ 



| (exactness) 

| (precision) 

| (precision) 

(uinteger R) — ► 



(sign) — ► (empty) | + | - 

(suffix) — ► (empty) | e (sign) (digit) + 

(exactness) — ► (empty) | #i | #e 

(precision) - — ► (empty) | #s | #1 

(radix 2) — ► #b 

(radix 8) — ► #o 

(radix 10) — > (empty) | #d 

(radix 16) — ► #x 

(digit 2) — ►Oil 

(digit 8) — >0|1|2|3|4|5|6|7 

(digit 10) — ► (digit) 

(digit 16) — ► (digit) |a|b|c|d|e|f 

7.1.2. External representations 

(Datum) is what the read procedure (section 6.10.2) suc- 
cessfully parses. Note that any string which parses as an 
(expression) will also parse as a (datum). 

(datum) — ^ (simple datum) | (compound datum) 



(simple datum) — ► (boolean) | (number) 

| (character) | (string) | (symbol) 
(symbol) — ► (identifier) 
(compound datum) — ► (list) | (vector) 
(list) — ► ((datum)*) | ((datum)* . (datum)) 

| (abbreviation) 
(abbreviation) — ► (abbrev prefix) (datum) 
(abbrev prefix) — ► * | * | , | , 
(vector) — ► #( (datum)*) 

7.1.3. Expressions 

(expression) — ► (variable) 
(literal) 

(procedure call) 
(lambda expression) 
(conditional) 
(assignment) 
(derived expression) 

(literal) — ► (quotation) | (self-evaluating) 
(self-evaluating) — ► (boolean) | (number) 

| (character) | (string) 
(quotation) — ► "(datum) | (quote (datum)) 
(procedure call) — ► ((operator) (operand)*) 
(operator) — * (expression) 
(operand) — ► (expression) 

(lambda expression) — ► (lambda (formals) (body)) 
(formats) — > ((variable)*) | (variable) 

| ( (variable) + . (variable)) 
(body) — ► (definition)* (sequence) 
(sequence) — ► (command)* (expression) 
(command) — * (expression) 

(conditional) — ► (if (test) (consequent) (alternate)) 
(test) — ► (expression) 
(consequent) — ► (expression) 
(alternate) — ► (expression) | (empty) 

(assignment) — ► (set! (variable) (expression)) 

(derived expression) — ► 

(cond (cond clause) + ) 
| (cond (cond clause)* (else (sequence))) 
| (case (expression) 

(case clause)"*") 
| (case (expression) 

(case clause)* 

(else (sequence))) 
| (and (test)*) 
| (or (test)*) 

| (let ((binding spec)*) (body)) 
| (let (variable) ((binding spec)*) (body)) 
j (let* ((binding spec)*) (body)) 
| (letrec ((binding spec)*) (body)) 
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j (begin (sequence)) 
| (do ((iteration spec)*) 
((test) (sequence)) 
(command)*) 
| (delay (expression)) 
| (quasiquotation) 

(cond clause) — ► ((test) (sequence)) 

| (else (sequence)) 

I ((test)) 

| ((test) »> (recipient)) 
(recipient) — ► (expression) 
(case clause) — ► (((datum)*) (sequence)) 

| (else (sequence)) 

(binding spec) — ► ((variable) (expression)) 
(iteration spec) — ► ((variable) (init) (step)) 

| ((variable) (init)) 
(init) — ► (expression) 
(step) — ► (expression) 

7.1.4. Quasiquotations 

The following grammar for quasiquote expressions is not 
context-free. It is presented as a recipe for generating an 
infinite number of production rules. Imagine a copy of the 
following rules for D = 1, 2, 3, . . .. D keeps track of the 
nesting depth. 

(quasiquotation) — ► (quasiquotation 1) 
(template 0) — ► (expression) 
(quasiquotation D) — ► " (template D) 

| (quasiquote (template D)) 
(template D) — ► (simple datum) 

| (list template D) 

| (vector template D) 

| (unquotation D) 
(list template D) — ► ((template or splice £>)*) 

| ((template or splice D) + . (template D)) 

j ' (quasiquotation D) 

| (quasiquotation D + 1) 
(vector template D) — ► # ((template or splice D)*) 
(unquotation D) — ► , (template D — 1) 

| (unquote (template D — 1) ) 
(template or splice D) — ► (template D) 

| (splicing unquotation D) 
(splicing unquotation D) — ► ,fi (template D — 1) 

| (unquote -splicing (template D — 1)) 

In (quasiquotation)s, a (list template D) can sometimes be 
confused with an (unquotation D) or (splicing unquotation 
D). The interpretation as an (unquotation) or (splicing 
unquotation D) takes precedence. 

7.1.5. Programs and definitions 

(program) — ► (command or definition)* 



(command or definition) — ► (command) | (definition) 
(definition) — ► (define (variable) (expression)) 

| (define ((variable) (def formals)) (body)) 
(def formals) — ► (variable)* 

| (variable)* . (variable) 

7.2. Formal semantics 

This section provides a formal denotational semantics for 
the primitive expressions of Scheme and selected built-in 
procedures. The concepts and notation used here are de- 
scribed in [50]; the notation is summarized below: 

(...) sequence formation 

s I k kth member of the sequence a (1-based) 

#3 length of sequence a 

a\t concatenation of sequences s and t 

s t k drop the first fc members of sequence s 

t —* a,b McCarthy conditional "if t then a else b" 

p[x/i] substitution "p with x for t" 

x in D injection of x into domain D 

x | D projection of x to domain D 

To avoid special treatment for a top-level environment, the 
semantics assumes that environments assign locations to all 
variables: 

The reason that expression continuations take sequences 
of values instead of single values is to simplify the formal 
treatment of procedure calls and to make it easy to add 
multiple return values. 

The order of evaluation within a call is unspecified. We 
mimic that here by applying arbitrary permutations per- 
mute and unpermute, which must be inverses, to the ar- 
guments in a call before and after they are evaluated. 
This still requires that the order of evaluation be con- 
stant throughout a program (for any given number of ar- 
guments), but it is a closer approximation to the intended 
semantics than a left-to-right evaluation would be. 

The storage allocator new is implementation-dependent, 
but it must obey the following axiom: if newae L, then 
a {new a | L) J. 2 = false. 

The semantics in this section was translated by machine 
from an executable version of the semantics written in 
Scheme itself. 

7.2.1. Abstract syntax 

K 6 Con constants, including quotations 

I e Ide identifiers (variables) 

E e Exp expressions 

T 6 Com = Exp commands 

Exp — ► K | I | (Eo E*) 

| (lambda (I*) T* E ) 
I (lambda (I* . I) T* Eq) 



7. Formal syntax and semantics 33 



| (lambda I T* E ) 
| (if Eo Ei E2) | (if Eo Ei) 
(set! I E) 



7.2.2. Domain equations 



a 6 L 
1/6 N 

T = {false, true} 

Q 
H 
R 
E p = L X L 



locations 
natural numbers 
booleans 
symbols 
characters 
numbers 
pairs 
Ey = L* vectors 

E s = L* strings 

M = {false, true, null, undefined, unspecified} 

miscellaneous 
d> 6 F = L X (E* — ► K — » C) procedure values 
«€E = Q + H + R + E p +Ev+E s + M + F 

expressed values 
a e S = L — » (E X T) stores 

p e U = Ide —► L environments 

e C = S — ♦ A command continuations 

k 6 K = E* — » C expression continuations 

A answers 

X errors 

7.2.3. Semantic functions 

K : Con -+ E 
£ : Exp -f U -► K -► C 
f*:Exp*-+U-»K — C 
C : Com* -tU-+C-*C 

Definition of K deliberately omitted. 

£{Kl = XpK. send (KIKJ)k 

£ m = XpK . hold (lookup p I) 

(singlc(\e . e = undefined — ► 

wrong "undefined variable", 
aenderc)) 

£((Eo E*)l = 
Apw . <?*(permu*e( (Eo) § E*)) 

P 

(Ae* . ((Ae* . applieate (e* 1 1) (e* f 1) «) 
(unpermufe e*))) 

£{(lambda (I*) T* Eo)J = 

XpK . Xff . 

new <r eL — » 
«end({new<r|L, 

Ae*«' . #e* = #1* -» 

tfevafa(Aa* . (A/ . C|[r*Jp'(£ [Eb] A')) 
(esrfendapl* «*)) 

€*, 

wronj "wrong number of arguments") 
inE) 



(update (newer I L) unspecified c), 
wrorij "out of memory" <r 

£f (lambda (I* . I) T* Eo)J = 
Ap« . A<7 . 
newc eL— ♦ 
«end ({newer jL, 

AeV . #e* > #1* -» 
tievalsrest 
(Aa*.(A/)'.Clr*lp'(£IEolp'«')) 

(erfend»p(I*§(I'))a*)) 
e* 

(#1*). 
wrong "too few arguments") in E) 

K 

(update (newer J L) unspecified cr), 
wrong "out of memory" <r 

c?[(lambda I T* Eo)l = £[(lambda (. I) T* Eo)J 

£[(if Eo Ei E a )l = 
XpK . fjEol p (single (Ae . truwA e -+ £ jEiJpK, 

flEaJpic)) 

£ |(i* Eo EJJ = 
Ap« . £|Eo| p (single (Ae . fruuA e ~» £ |EiJp«, 

send unspecified «)) 

Here and elsewhere, any expressed value other than undefined 
may be used in place of unspecified 

c?|(set! I E)J = 
ApK . £|E] p (single(Xe . assign (lookup p I) 

e 
(send unspecified k))) 

£*{\=Xpk.k() 

f*[EoE*l = 
Ap« .£lEo]p (single(Xe . £*fE*| p (Ae* . k ((e ) § e*)))) 

C\] = X P 9.e 

C{T T*J = Apt? . £[T \ p (Ae* . C[r*]p6) 

7.2.4. Auxiliary functions 

lookup : U — ♦ Ide — ♦ L 
lookup = Apl . pi 

eirfende : U -► Ide* -♦ L* -» U 
extends = 
Apl*a* . #1* = -> p, 

erfend, (p[(e** i l)/(I* 1 1)]) (I* f 1) (<** 1 1) 

wronj : X — * C [implementation-dependent] 

send : E — » K — » C 
send= Xck . n(e) 

single : (E -» C) -» K 
single = 
AV-e*.#e* = l-»^(e*il), 

wrong "wrong number of return values" 
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new : S — ► (L + {error}) [implementation-dependent] 

hold :L-»K-»C 

hold = Xcxko- . »tnd{<xa I 1)k<t 

assign : L — ► E — » C -+ C 
assign = XcteBo- . 0(update cteo) 

update : L -+ E — > S -» S 
update = Xatc . <r[(e, true) /a] 

tievals : (L* -♦ C) -♦ E* -♦ C 
tievals = 
\il>e*<r . #«* = -» tf( )<r, 

netu<reL-» tfewoto (Aa* . ^((newc |L) § a*)) 

(«*tl) 

(upda<e(netoo- 1 L)(e* I l)<r), 
toronj "out of memory* c 

tievalsrest : (L* -♦ C) -> E* -> M -+ C 
tievalsrest = 
Ai/>e*i/ . fof (dropfirst e*v) 

(single(Xe , tievals x/> ((takefirst e*u) § {«)))) 

dropfirst = Xln . n = -+ i, dropfirst (I t l)(n - 1) 

iaifee/ira< = Ain . n = -» {). (U 1} § (takefirst {I t l)(n - 1)) 

iruwft : E — ► T 

/rut's/* = A« . (e = false V 6 = n«//) -+ false, true 

permute : Exp* — » Exp* [implementation-dependent] 

unpermute : E* — » E* [inverse of permute] 

applicate : E -► E* -♦ K -+ C 
applieate = 
\ee*K . e € F -» (e | F J, 2)e*«, wronj "bad procedure* 

onearg : (E -♦ K -+ C) -♦ (E* -> K -+ C) 
onearg = 
A?£*K.#£* = l-»?(e*;iK 

toronj "wrong number of arguments" 

twoarg :(E-»E->K-»C)->(E*-+K-»C) 
twoarg = 
A ? e*K.#€* = 2-+ f ( e *il)(e*i2)K, 

toronj "wrong number of arguments" 

list : E* -> K -> C 
«rf = 
Ae*/c . #e* = -+ send nullK, 

list(e* f l)(single(\e . cons(e* I l,e}«)) 

cons : E* -* K -» C 
con* = 
twoarg (XeiizKff . new a e L — ► 

(Aff' . new c' € L -* 

send ((new <r | L, new a' | L) in E) 

K 

(update(newor' \ L)ett<r'), 
wrong "out of memory* a') 
(update(newo~ \ L)«i<r), 
wrong "out of memory" <r) 



less : E* -♦ K -♦ C 

less = 

twoarg (Aeie 3 »e . (ei e R A e 3 € R) — ► 

send(ei | R < e 3 | R -» true,false)K, 
wrong "non-numeric argument to <*) 

add : E* -► K -» C 

twoarg (Aeie 3 »c . (ex e R A 62 6 R) -♦ 

tend ((61 | R+ e 3 | R) in E)/c, 

wrong "non-numeric argument to +*) 

ear : E* -+ K -+• C 
ear = 

onearg (Xen .«eEp-» hold(e | Ep i 1)k, 

wrong "non-pair argument to car") 

cdr : E* — ► K — ► C [similar to car] 

aefear : E* -* K -» C 
«e<car = 

twoarg (Aeie 3 K . ei e Ep -+ aaw'jn (ei | Ep 1 1) 

ea 

(tend unspecified k), 
wrong "non-pair argument to set-car!") 

eqv : E* -♦ K -* C 
eqv = 
twoarg (Aeie 3 K . (ei e M A e 3 € M) — ► 

send(ei | M = e 3 | M — ► true,faise)n, 
(ei 6 Q A e 2 e Q) -+ 

«end(ei | Q = e 3 | Q -» true,false)K, 
(ei6HAe 3 6H)-* 

«end(ei | H = ea | H -» true,false)K, 
(ei eRAcjeR) -» 

«end(e x | R = ea | R -» true,false)K, 
(ei e Ep A 63 e E p ) -+ 

«nd((A Pl pa . ((pi i 1) = (pa i l)A 

(Pi i 2) = (pa i 2)) -+ irue, 
/o/ae) 
(ei|E P ) 
(e 2 |Ep)) 

K, 

(ei G Ev A e 2 € E v ) -»• . . . , 
(ei 6 E, A e 2 e E.) -» . . . , 
(«i€FAfj6F)-» 

«nd((ei I F i 1) = (ea I F J. 1) -+ truejalse) 

«end /a/ae k) 

apply : E* -» K -+ C 
apply = 
iwoorj (AeuaK . e t e F -» t/o/«eah'at {e 2 )(Ae* . opp/icoie eie*«), 
toronj "bad procedure argument to apply") 

vaiuetUst : E* -» K -> C 
t/a/ue«K«t = 

oneorj (Ae/c . e e E p -» 
erfr(e) 

(Ae* . valueslist 
e* 

(Ae* . car(e)(amff/e(Ae . «((e) § e*))))), 
e = nu//-» k{), 

turonj "non-list argument to valuea-list") 
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ewce : E* -» K — ► C [eall-wlth-current-continuation] 
cwcc = 

on«orj(A«/c.e6F — » 

(Aff . nevxr eL — » 
applieate e 

({newff | L,A«*k' .«£*) in E) 
/c 

(updafe (newer | L) 
unspecified 

'), 
wrong "out of memory" <r), 
wrong "bad procedure argument*) 

7.3. Derived expression types 

This section gives rewrite rules for the derived expression 
types. By the application of these rules, any expression 
can be reduced to a semantically equivalent expression in 
which only the primitive expression types (literal, variable, 
call, lambda, if, set!) occur. 

(cond ((test) (sequence)) 
(clauses) •••) 
= (if (test) 

(begin (sequence)) 
(cond (clauses) •••)) 



(cond ((test)) 

(clauses) • • • ) 
= (or (test) (cond (clauses) 



.)) 



(cond ((test) -> (recipient)) 
(clauses) ...) 
= (let ((test-result (test)) 

(thunk2 (lambda () (recipient))) 
(thunkS (lambda () (cond (clauses) 
(if test-result 

((thunk2) teat -result) 
(thunkS))) 

(cond (else (sequence))) 
= (begin (sequence)) 



(cond) 

= (some expression returning an unspecified value) 

(case (key) 

((dl ...) (sequence)) 

...) 

a (let ((key (key)) 

(thunkl (lambda () (sequence))) 
...) 
(cond (((memv) key '(dl ...)) (thunkl)) 

...)) 

(case (key) 

((dl ...) (sequence)) 

(else fl tl ...)) 
= (let ((key (key)) 



.)))) 



(thunkl (lambda () (sequence))) 

(elsethunk (lambda ()...))) 
(cond (((memv) key '(dl ...)) (thunkl)) 

(else (elsethunk)))) 

where (memv) is an expression evaluating to the memv pro- 
cedure. 

(and) = #t 

(and (test)) = (test) 

(and (test,) (tests) ...) 
= (let ((x (test,)) 

(thunk (lambda () (and (tests) •••)))) 
(if x (thunk) x)) 



(or) = #f 

(or (test)) = (test) 

(or (test,) (testa) ...) 
s (let ((x (testi)) 

(thunk (lambda 
(if x x (thunk))) 



() (or (test,) ...)))) 



(let (((variable,) (init,)) ...) 
(body)) 
= ((lambda ((variable,) ...) (body)) (init,) ...) 

(let* () (body)) 

s ((lambda () (body))) 
(let* (((variable,) (init,)) 
((variables) (inita)) 
...) 
(body)) 
S (let (((variable,) (init,))) 

(let* (((variables) (init,)) 
...) 
(body))) 

(letrec (((variable,) (init,)) 
...) 
(body)) 

s (let (((variable,) (undefined)) 
...) 
(let (((temp,) (init,)) 
...) 
(sat! (variable,) (temp,)) 
...) 
(body)) 

where (tempi), (tempa), ... are variables, distinct from 
(variablei), ..., which do not free occur in the original 
(init) expressions, and (undefined) is an expression which 
returns something which when stored in a location makes 
it an error to try to obtain the value stored in the location. 
(No such expression is defined, but one is assumed to ex- 
ist for the purposes of this rewrite rule.) The second let 
expression in the expansion is not strictly necessary, but it 
serves to preserve the property that the (init) expressions 
are evaluated in an arbitrary order. 



36 Revised 3 Scheme 



(begin (sequence)) 

= ((lambda () (sequence))) 

The following alternative expansion for begin does not 
make use of the ability to write more than one expres- 
sion in the body of a lambda expression. In any case, note 
that these rules apply only if (sequence) contains no defi- 
nitions. 

(begin (expression)) = (expression) 
(begin (command) (sequence)) 

= ((lambda (ignore thunk) (thunk)) 

(command) 

(lambda () (begin (sequence)))) 

(do (((variablei) (initi) (stepi)) 
...) 

((test) (sequence)) 
(command i) ...) 
s (letrec (((loop) 

(lambda ((variablei) ...) 
(if (test) 

(begin (sequence)) 
(begin (command i) 



((loop) (initx) ...)) 



((loop) (ste Pl ) ...)))))) 



where (loop) is any variable which is distinct from 
(variablei), ..., and which does not occur free in the do 
expression. 

(let (variableo) (((variablei) (initi)) •••) 
(body)) 

= ((letrec (((variableo) (lambda ((variablei) •••) 

(body)))) 
(variableo)) 
(initx) ...) 

(delay (expression)) 

= ((make-promise) (lambda () (expression))) 

where (make-promise) is an expression evaluating to some 
procedure which behaves appropriately with respect to the 
force procedure; see section 6.9. 



NOTES 

Language changes 

This section enumerates the changes that have been made 
to Scheme since the "Revised revised report" [4] was pub- 
lished. 

• The character * (circumflex) is now an extended al- 
phabetic character 

• The objects returned by literal expressions are permit- 
ted to be immutable 

• The list to which a rest-argument becomes bound must 
be newly allocated 

• Do variables are updated by rebinding rather than by 
assignment 

• New expression type: delay 

• Quasiquote (backquote) has been improved in several 
ways: vectors are allowed; nesting is allowed; and an 
external syntax for quasiquote expressions (analogous 
to that for quote) has been defined 

• The semantics of definitions of the form (define 
((variable) (formals)) (body)) no longer involves an 
implicit rec or letrec 

• The "curried" definition syntax has been removed 

• The boolean constants are now written #t and #f in- 
stead of #! true and #!false 

• The syntax #!null (for the empty list) has been re- 
moved 

• New procedures: boolean?, procedure?, and force 

• The value of eq? on numbers and characters is now 
unspecified 

• Eq? and eqv? now explicitly permit operationally 
equivalent procedures to be identified 

• Eqv? distinguishes exact numbers from inexact ones, 
even if they are equal according to ■ 

• List, string, and vector indexes must be exact integers 

• At an now admits either one or two arguments 

• Expression types removed: named- lambda, rec, 
sequence 

• Procedures removed: append!, string- 
null?, substring-fill!, substring-move-left!, 
substring-move-right!, object -hash, object- 
unhash, 1+, -1+ 

• Redundant procedure names removed: <?, <«*?, ■?, 
>?, and >-? 
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Keywords as variable names 

Some implementations allow arbitrary syntactic keywords 
to be used as variable names, instead of reserving them, 
as this report would have it. But this creates ambiguities 
in the interpretation of expressions: for example, in the 
following, it's not clear whether the expression (if 12 3) 
should be treated as a procedure call or as a conditional. 



(define if list) 
(if 12 3) 



2 or (1 2 3) 



These ambiguities are usually resolved in some consistent 
way within any given implementation, but no particular 
treatment stands out as being clearly superior to any other, 
so these situations were excluded for the purposes of this 
report. 

Macros 

Scheme does not have any standard facility for defining 
new kinds of expressions. 

The ability to alter the syntax of the language creates nu- 
merous problems. All current implementations of Scheme 
have macro facilities that solve those problems to one de- 
gree or another, but the solutions are quite different and 
it isn't clear at this time which solution is best, or indeed 
whether any of the solutions are truly adequate. Rather 
than standardize, we are encouraging implementations to 
continue to experiment with different solutions. 

The main problems with traditional macros are: They 
must be defined to the system before any code using them 
is loaded; this is a common source of obscure bugs. They 
are usually global; macros can be made to follow lexical 
scope rules , but many people find the resulting scope rules 
confusing. Unless they are written very carefully, macros 
are vulnerable to inadvertant capture of free variables; to 
get around this, for example, macros may have to generate 
code in which procedure values appear as quoted constants. 
There is a similar problem with syntactic keywords if the 
keywords of special forms are not reserved. If keywords are 
reserved, then either macros introduce new reserved words, 
invalidating old code, or else special forms defined by the 
programmer do not have the same status as special forms 
defined by the system. 



EXAMPLE 

Integrate-system integrates the system 

yfc = /fc(yi»y2 y„), *= i,...,n 

of differential equations with the method of Runge-Kutta. 

The parameter system-derivative is a function that 
takes a system state (a vector of values for the state vari- 
ables yi, . . . , y n ) and produces a system derivative (the val- 
ues s4,...,$4)' The parameter initial-state provides 
an initial system state, and h is an initial guess for the 
length of the integration step. 

The value returned by integrate-system is an infinite 
stream of system states. 

(define integrate-system 

(lambda (system-derivative initial-state h) 

(let ((next (runge-kutta-4 system-derivative h))) 
(letrec ((states 

(cons initial-state 

(delay (map-streams next 

states))))) 
states)))) 

Runge-Kutta-4 takes a function, f , that produces a system 
derivative from a system state. Runge-Kutta-4 produces 
a function that takes a system state and produces a new 
system state. 

(define runge-kutta-4 
(lambda (f h) 

(let ((*h (scale-vector b.)) 
(*2 (scale-vector 2)) 
(*l/2 (scale-vector (/ 1 2))) 
(*l/6 (scale-vector (/ 1 6)))) 
(lambda (y) 

; ; y is a system state 
(let* ((kO (*h (f y))) 

(kl (*h (f (add-vectors y 
(k2 (*h (f (add-vectors y 
(k3 (*h (f (add-vectors y 
(add-vectors y 

(*l/6 (add-vectors kO 

(*2 kl) 
(*2 k2) 
k3)))))))) 

(define elementwise 
(lambda (f) 

(lambda vectors 
(generate-vector 

(vector-length (car vectors)) 
(lambda (i) 
(apply f 

(map (lambda (v) (vector-ref v i)) 
vectors))))))) 

(define generate-vector 
(lambda (size proc) 



(*l/2 kO))))) 
(*l/2 kl))))) 
k2))))) 
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(let ((ana (make-vector size))) 
(letrec ((loop 

(lambda (i) 
(cond ((« i size) aas) 
(else 
(vector-set! ana i (proc i)) 
(loop (♦ i 1))))))) 
(loop 0))))) 

(define add-vectora (elementwiae +)) 

(define scale-vector 
(lambda (8) 

(elementwiae (lambda (x) (* x a))))) 

Map-streams is analogous to map: it applies its first argu- 
ment (a procedure) to all the elements of its second argu- 
ment (a stream). 

(define map-streams 
(lambda (f a) 

(cons (f (head a)) 

(delay (map-streams f (tail a)))))) 

Infinite streams are implemented as pairs whose car holds 
the first element of the stream and whose cdr holds a 
promise to deliver the rest of the stream. 

(define head car) 
(define tail 

(lambda (stream) (force (cdr stream)))) 



The following illustrates the use of integrate-system in 
integrating the system 

,dvo . vo 



C 



dt 



-XL 



R 



T di L 

L -d7 = Vc 

which models a damped oscillator. 

(define damped-oscillator 
(lambda (R L C) 
(lambda (state) 

(let ((Vc (vector-ref 8tate 0)) 
(II (vector-ref atate i))) 
(vector (- (+ (/ Vc (* R C)) (/ II C))) 
(/ Vc L)))))) 

(define the- states 
(integrate- ayatem 

(damped-oscillator 10000 1000 .001) 

'#(1 0) 

.01)) 
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ALPHABETIC INDEX OF DEFINITIONS OF CONCEPTS, 
KEYWORDS, AND PROCEDURES 



The principal entry for each term, procedure, or keyword is 
listed first, separated from the other entries by a semicolon. 

! 5 

' 7; 15 

* 19; 7 

+ 19; 4, 7, 12, 27, 34 

, 11; 15 

- 20; 4 
/ 20 

I 5 

< 19; 34 

<- 19; 24 

- 19; 13, 36 
-> 8 

> 19 
>■= 19 
? 5 

abs 20; 12, 21 
acos 20 
and 9; 12 
angle 21 
append 16 
apply 26; 34 
asin 20 
assoc 16; 17 
assq 16; 17 
assv 16; 13, 17 
at-sign 11 
atan 20; 21, 36 

#b 19, 31 
backquote 11 
begin 10; 36 
binding 5 

binding construct 6 
boolean? 12; 36 
bound 6 

caar 15 

caddr 15 

cadr 15 

call 7 

call by need 10 

call-with-current-continuation 27; 28, 34 

call-with-input-file 28 

call-with-output-f ile 28 

call/cc 28 

car 15; 14, 34 

case 8 

catch 28 

cdddar 15 



cddddr 15 

cdr 15; 14, 27 

ceiling 20 

char->integer 24 

char-alphabetic? 23 

char-ci<«? 23 

char-ci<? 23 

char-ci-? 23 

char-ci>«? 23 

char-ci>? 23 

char-dovmcase 24 

char-lower-case? 24 

char-numeric? 23 

char-ready 29 

char-ready? 29 

char-upcase 24 

char-upper-case? 24 

char-whitespace? 23 

char<-? 23; 24 

char<? 23; 25 

char-? 23; 13 

char>-? 23 

char>? 23 

char? 23 

close-input-port 29 

close-output-port 29 

combination 7 

comma 11 

comment 5; 30 

complex? 19 

cond 8; 12 

cons 15; 14 

cos 20 

current -input -port 28; 29, 30 

current-output-port 28; 29, 30 

#d 19 

define 11; 12 
delay 10; 26, 27, 36 
denominator 20 
display 29 
do 10; 6, 12, 36 
dotted pair 14 

#e 19, 31 

else 8 

empty list 14; 12, 13, 15, 16, 36 

eof -object? 29 

eq? 14; 13, 16, 36 

equal? 14; 13, 16, 25 

equivalence predicate 13 

eqv? 13; 8, 14, 15, 16, 17, 36 
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error 3 

escape procedure 27 

essential 3 

even? 19 

exact -> inexact 21 

exact? 19 

exactness 18 

exactness 23 

exp 20 

expt 21 

#f 12 
false 6; 12 
fix 22 
flo 22 
floor 20 
f oo 4, 11 
for-each 26 
force 26; 10, 27, 36 

gcd 20 

gen-counter 13 
gen- loser 13 

heur 22 

#i 19,22,31 
identifier 4; 5, 17, 30 
if 8; 12, 33, 35 
imag-part 21 
improper list 15 
inexact->exact 21 
inexact? 19 
initial environment 12 
input-port? 28 
int 22 

integer->char 24 
integer? 19 
integrate-system 37, 38 
internal definition 12 

keyword 4, 5, 30, 37 

#1 19,31 

lambda 7; 12, 33, 35 
lambda expression 6 
last-pair 16 
lazy evaluation 10 
lcm 20 
length 16 
let 9, 10; 6, 12, 35 
let* 9; 6, 12 
letrec 9; 6, 10, 12 
list 16; 25 
list->string 25 
list->vector 26 



list-ref 16 
list-tail 16 
load 30 
log 20 

macros 37 
magnitude 21 
make-polar 21 
make-promise 27 
make-rectangular 21 
make- string 24 
make-vector 25 
map 26; 38 
map-streams 38 
max 19 
member 16; 17 
memq 16; 13, 17 
memv 16; 17, 35 
min 19 
modulo 20 
mutation procedure 13 

negative? 19 
newline 29 
nil 12 
not 12 
null? 16 
number 17 
number->string 21 
number? 19 
numerator 20 

#o 19,22,31 
odd? 19 

open-input-file 28 
open-output-file 28 
operationally equivalent 13 
or 9; 12 
output-port? 28 

pair 14 
pair? 15 
polar 22 
positive? 19 
predicate 13 
procedure call 7 
procedure? 26; 36 
promise 10; 26 

quasiquote 11; 15 
quote 7; 6, 15, 36 
quotient 20 

radix 23 
rat 22 
rational? 19 
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rationalize 20 

read 29; 3, 6, 15, 17, 31 

read-char 29 

real-part 21 

real? 19 

rect 22 

region 6; 8, 9, 10 

remainder 20 

return 28 

reverse 16 

round 20 

runge-kutta-4 37 

#s 19,31 

aci 22 

sequence 10 

set! 8; 11, 12, 33, 35 

set-car! 15; 7, 13, 14, 34 

set-cdr! 15; 14 

sin 20 

sqrt 21; 18, 19 

string->list 25 

string- >number 21 

string->symbol 17 

string-append 25 

string-ci<«? 25 

Btring-ci<? 25 

string-ci-? 24; 25 

string- ci>«»? 25 

string-ci>? 25 

string-copy 25 

string-fill! 25 

string- length 24 

string-ref 24 

string-set ! 24; 7, 17 

string<-? 25 

string<? 25 

string-? 24; 25 

string>«? 25 

string>? 25 

string? 24 

substring 25 

symbol->string 17 

symbol? 17 

syntactic keyword 4, 5, 30, 37 



unbound 6; 12 
unquote 11, 15 
unquote-splicing 11, 15 

valid indexes 24; 25 
values-list 34 
variable 5; 4, 6, 31, 37 
vector 25 
vector->list 26 
vector-fill! 26 
vector-length 25 
vector-ref 25 
vector-set ! 25 
vector? 25 

whitespace 5 

with-input-from-file 28 
with-output-to-file 28 
write 29; 6, 11, 17 
write-char 29 

#x 19, 31 

zero? 19 



#t 12 

t 12 

tan 20 

token 30 

top level environment 12; 6 

transcript-off 30 

transcript-on 30 

true 6; 8, 12 

truncate 20 



As the Scheme report was being prepared for publication, it was discovered 
that a concise language definition of only 43 pages was too thin to be bound 
properly. The following article has been appended in order to remedy this 
deficiency. 



Computation: An Introduction to Engineering Design 

Harold Abelson 
Gerald Jay Sussman 

Massachusetts Institute of Technology 

Computer science is not a science. Its significance has little to do with 
computers. The real significance of the computer revolution is that it is a 
revolution in the way we think, and in the way in which we express what 
we think. The essence of this change is the emergence of what might best be 
called procedural epistemology — the study of the structure of knowledge from 
an imperative point of view, as opposed to the more declarative point of view 
taken by classical mathematical subjects. A computer language, from this 
perspective, is a novel formal medium for expressing ideas about methodology, 
not just a way to get a computer to perform operations. Programs are written 
for people to read, and only incidentally for machines to execute. 

The MIT introductory computer-science subject reflects this view. In 
this subject, the syntax of particular programming-language constructs, clever 
algorithms for computing particular functions efficiently, or even the math- 
ematical analysis of algorithms and the foundations of computing are only 
peripheral issues. The essential material is the techniques used to control the 
complexity of large systems: 

1. We create abstractions to hide detail and to separate specification from 
implementation. 

2. We describe a design in terms of a sequence of levels of description. Ro- 
bust design requires that, at each level, the descriptive framework is rich 
enough so that small modifications of the design are only slightly different 
in description. 

3. We establish conventional interfaces that enable us to combine standard 
well-understood pieces in a mbc-and-match way. 

4. We make languages to describe designs in appropriate ways. Each lan- 
guage emphasizes particular aspects of designs and deemphasizes others. 

These techniques are not unique to the organization of computational 
systems. They are common to all of engineering design. By making these 
the centerpiece of our presentation, we come to view computer science as 
an abstract form of engineering. In a computational system, the elementary 
pieces are idealized and completely specified. Thus, the complexity of a design 
is limited only by the mind of the designer, not by the approximations inherent 
in modeling physical reality. 
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One way to control complexity is by using abstractions to hide details. A 
design language provides a set of primitive constructs, means by which these 
can be combined, and means of abstraction by which combinations may be 
named and manipulated as if they were primitive. In programming, we often 
express abstractions as procedures that capture general patterns of operations. 

A framework for thinking about design must not arbitrarily limit our abil- 
ity to make abstractions. In particular, an expressive programming language 
should draw no distinction between patterns that abstract over procedures 
and patterns that abstract over other kinds of data. Procedures must be al- 
lowed to be passed as arguments, returned as values, and incorporated into 
data structures. Prodedural data allows us to capture general methods so 
that we can name, analyze, and build on them. 

From a pedagogical point of view, expressing general methods as ex- 
ecutable procedures makes them concrete and often easier to understand. 
Mathematical constructions that involve functional operators are naturally 
expressed in this way. 

Here is a procedure that expresses the concept of summation — summing 
the values of a given function / evaluated at discrete points on the interval 
from a to 6: 

(define (sua t a next b) 
(if (> a b) 

(+ (f a) 

(sun f (next a) next b)))) 

Sum takes as its arguments the lower bound a, the upper bound b, the pro- 
cedure 1 that computes the value of the function /, and the procedure next 
that computes the next point of the interval to consider. The above definition 
expresses the idea that the sum over the interval from ato&is0ifa>6 and 
otherwise is equal to /(a) plus the sum over the interval from the next value 
after a until b. 

We can use the sum procedure as a building block in expressing further 
concepts. For instance, the definite integral of a function / between the limits 
a and b can be approximated numerically using the formula 



/*'-[' (* + f) +/(•+*+!) + '(- +M,+ f) 



dx 
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and we can express this formula procedurally in terms of sum: 

(define (integral fab dx) 
(define (add-dx x) (+ x dx)) 
(* (sum f (+ a (/ dx 2)) add-dx b) 
dx)) 

Now we can use integral as a building block in more general construc- 
tions. As an example, consider Picard's method for solving ordinary differen- 
tial equations of the form dx/dt = f(x,t), where x is a function of t with a 
given initial value x(0) = x . 

Picard's method for approximating the function x (the solution of the dif- 
ferential equation) uses the technique of iterative improvement. We choose an 
arbitrary function to serve as an initial approximation to x (for example, the 
function that has constant value x ) and repeatedly improve this approxima- 
tion by applying an operator that, given a function, produces a new function 
that better approximates x. This operator, when applied to a function y, 
produces the function whose value at t is 



f f(y(s),s)d a 



X + 

'0 

This function is a better approximation to the solution of the differential 
equation than y is. 

We can express the Picard operator as the following procedure: 

(define (((picard-operator f xO) y) t) 
(define (picard- integrand a) 

(f (y a) a)) 
(♦ xO (integral picard- integrand t dx))) 

The picard-operator procedure takes as its arguments a function / and the 
initial value x . It produces an operator, which when applied to a function y, 
produces a function of t. 

The nth-order Picard approximation to the solution x is obtained by 
beginning with any arbitrary function, and applying the operator n times. 

(define (picard-method f xO n) 
(define (arbitrary t) xO) 

((repeated-application n (picard-operator f xO)) 
arbitrary)) 

The repeated-application procedure takes a number and an operator 
as arguments. It produces the operator composed with itself the given number 
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of times. 

(define (repeated-application n operator) 
(il (- n 0) 

identity-operator 
(compose operator 

(repeated-application (- n 1) operator)))) 

(define ((compose f g) x) 
(f (g x))) 

To the student writing this program Picard's method is no longer a bunch 
of mathematical formulae, tied together with informal prose. It is actually 
executable, in a completely formal way. He can try it out on simple cases, 
examining the dynamic behavior and how the successive approximations relate 
to one another. This kind of activity leads to more complete understanding. 

Levels of language for robust design 

A complex design is a sequence of levels of description. Each level describes 
how parts of the design are constructed from elements that are regarded as 
primitive at that level. The parts constructed at each level are used as prim- 
itives at the next level. Thus, in designing electrical circuits, we use transis- 
tors, resistors and capacitors to build amplifiers, filters and oscillators. We 
assemble these to make various kinds of radios and sound equipment. 

If a design is to be robust, its parts must have more generality than is 
needed for the particular application. The means for combining the parts 
must allow variations in the design plan, such as substituting similar parts for 
one another and varying the arrangement in which parts are combined. This 
generality insures that, at any level, small design changes can be expressed 
by making small changes in the description. A powerful design framework 
must therefore be more than just a way to build hierarchies of abstractions. 
It must be a succession of languages each complete in itself. 

We see this succession of languages in electrical circuit design, where 
there are many conventional kinds of diagrams for describing how circuit 
elements are interconnected to produce modules, how modules are assembled 
into subsystems, and how subsystems are combined to make systems. Often, 
a particularly useful language is standardized as a commercial component 
family, such as TTL or CMOS to describe interconnection of digital-logic 
elements or Multibus to describe the assembly of board-level modules. 

The idea of stratified design was used by Peter Henderson [2] in a beauti- 
ful analysis of the construction of the "Square Limit" woodcut of M.C.Escher. 
Henderson created a system for easily describing such images. Henderson's 
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system consists of several languages. There is a language of primitive pictures 
that are constructed from points and lines. Built on top of this is a language 
of geometric combination that describes how pictures are placed relative to 
another to make compound pictures. At the next level, there is a language of 
general patterns of combination. 

In Henderson's system, a picture is a procedure that takes a rectangle 
as argument, and draws an image scaled to fit the rectangle. At the lowest 
level of description, a picture is described as a collection geometric elements, 
where each element is specified in terms of (x, y) coordinates with respect to 
the unit square (0 < x,y < 1). Figure 1 shows two simple pictures, diamond 
and leg, each described as line-segments that connect designated vertices. 
Besides segments, other picture elements appropriate to this level are circles 
with specified radii and centers, spline curves through designated points, and 
so on. 

The next level of description in Henderson's system is a language of geo- 
metric combinators — placing pictures beside or above one another, or rotating 
pictures through multiples of 90 degrees. For example, the beside combina- 
tor horizontally adjoins two pictures so that their widths are in a given ratio. 
Figure 2 shows two pictures adjoined by the beside combinator. 

One important feature of these geometric combinators is that pictures 
are closed under combination: The beside of two pictures is itself a picture 
and thus can be further combined with other pictures. Combinators can be 
abstracted: We can express common patterns of picture combination as new 
picture combinators defined in terms of other combinators. For example, a 
triangle combination is formed by placing one picture above two copies of 
another: 

(define (triangle pictl pict2 ratio) 
(above pictl 

(beside pict2 pict2 .6) 
ratio)) 
Moreover, since the combinators are expressed procedurally, we can easily 
construct complex combinators. The recursively defined combinator 

(define (right-push pict n ratio) 
(if (- n 0) 
pict 
(beside pict 

(right-push pict (- n 1) ratio) 
ratio))) 
produces the result of repeatedly adjoining n copies of a given picture, scaled 
by the given ratio as shown in figure 3. Similarly, using the primitive combi- 
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(deflae diaaoad 

(let ((vl (vertex 0.6 0)) 
(v2 (vertex 1 0.6)) 
(v3 (vertex 0.6 1)) 
(v4 (vertex 0.6))) 
(primitive-picture 
(segment vl v2) 
(ssgaeat v2 v3) 
(segaeat v3 v4) 
(segaeat v4 vl)))) 



(deflae leg 

(let ((vl (vertex (/ 1 8) 0)) 
(v2 (vertex (/ 14) 0)) 
(v3 (vertex 1 (/ 3 4))) 
(v4 (vertex 1 (/ 7 8))) 
(v6 (vertex (/ 1 4) (/ 1 3))) 
(v« (vertex (/ 1 8) (/ 1 2)))) 
(primitive-picture 

(segaeat vl v6) 

(■egaeut v6 v4) 

(■•gaeat v3 v6) 

(segaeat v6 v2)))) 



Figure 1. At the lowest level of description in Henderson's language, pic- 
tures are specified as collections of individual geometric elements. Here 
are two simple pictures, diamond and leg. 

nator above we can write 

(define (up-pnsh pict n ratio) 
(if (- n 0) 
pict 
(above pict 

(up-push pict (- n 1) ratio) 
ratio))) 
The combinator language derives its power from the closure and abstraction 
properties of the combinators. That is why it can describe seemingly complex 
figures using only a few simple ideas. 

The combinators themselves are manipulated at a third level of descrip- 
tion that captures common patterns of combining picture combinators. Thus, 
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(define legs (beside leg (rotateQO leg) .3)) 

Figure 2. The picture language include* geometric combinatory for ad- 
joining pictures horiiontally and vertically, and for rotating pictures by 
90 degrees. Here is a picture formed from elements in figure 1. 



right-push and up-push are instances of the general pattern of pushing, 
which we can implement as a repeated application: 

(define up-pnsh (push above)) 

(define right-push (push beside)) 

(define ((push combiner) pict n ratio) 
(define (basic -combination p) 

(combiner pict p ratio)) 
((repeated-application n basic -combination) pict)) 

Push is a higher-order combinator that transforms combinators to more elab- 
orate combinators. Having captured the idea of pushing, we can apply it to 
arbitrary combinators, for instance (see figure 4): 

(define pyramid (push triangle)) 

Given such a description of the picture in figure 4, we can easily vary the 
pieces at any level: We could change the location of a point in the primitive 
picture leg, we could replace legs by leg in the basic repeated unit, or we 
could even replace pyramid by some other picture plan based upon triangle. 
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(right-push leg 4 .4) 

Figure 3. The combinator right-push iterates the beside combinator. 
Here we see the result of adjoining leg to itself 4 times. 



Standard components and conventional interfaces 

In addition to controlling complexity by building abstractions, we control 
complexity by making designs modular. A modular design is a combination of 
relatively independent pieces. We can encourage modular design by providing 
a library of standard components together with a conventional interface for 
connecting the components in a variety of ways. In digital design, for example, 
all components in standard logic families such as TTL have the same power- 
supply requirements and the same signal definitions. 

Electrical engineers often design systems in terms of a language of block 
diagrams, consisting of blocks connected by lines. The blocks represent func- 
tions that process signals. The lines represent the communication paths by 
which these functional blocks are interconnected. The power in this language 
is in the assumption that all blocks obey the same interface specifications, so 
that they can be interconnected in a variety of configurations. 

The programming analogue of the block-diagram language is based upon 
data objects called streams. A stream is a (possibly infinite) sequence of 
values. For instance, we could represent keyboard input to a program as a 
stream of characters. In signal-processing applications, we could use streams 
of numbers to represent the values of signals sampled at discrete intervals. 
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(define animal (triangle diamond lags .3)) 




(pyramid animal 3 . 5) 

Figure 4. The triangle combinator placet one picture above two copies 
of another. We can use triangle to combine diaaond and legs. Iterating 
the triangle combinator by meant of the higer-order combinator puah 
produces the pyramid combinator. 
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For fun, we might have the stream of all positive integers. Functional blocks 
are procedures that take streams as inputs and produce streams as outputs. 
We might have a procedure that takes a stream of numbers and puts out a 
stream of the squares of the numbers in its input stream. The streams are 
the conventional interface through which such blocks are connected to form 
systems. 

Suppose we have a filter block that takes a stream as input and pro- 
duces the stream of only those values in the input stream that satisfy a given 
condition; a map block that applies a given operation to successive items in a 
stream, producing the stream of resulting values; and an add- streams block 
that produces the elementwise sum of two streams. Then, given the stream of 
positive integers we can filter this to form the stream of odd integers, square 
each result, and add these to the integers themselves: 

(add-streams (map square 

(filter odd? integers)) 
integers) 

Alternatively, by connecting the blocks in a different configuration, we 
can form the sum of the integers and the squares of the integers, and filter 
these to select the odd elements: 

(filter odd? 

(add-streams integers 

(map square integers))) 

The corresponding block diagrams are shown in figure 5. 

We can perform other operations on streams. For instance, we can merge 
two ordered streams to produce an ordered stream. If we have two streams, 
we can produce the stream of all possible pairs («,*) where a is in the first 
stream and t is in the second stream. In fact, working with such streams 
(and analogous triples, 4- tuples, etc.) corresponds to using nested loops in 
traditional programming style. 

As an illustration of the expressive power of the stream formulation, 
consider the problem of finding the numbers that can be written as the sum 
of two cubes in two different ways — the so-called Ramanujan numbers. We 
assume that we have a stream operation unique -pairs that, given a stream 
of values a if produces the stream of all pairs («<,«>) with t < j. To generate 
the Ramanujan numbers, we map the along the stream of pairs of integers, 
generating the sum of the cubes of the two components in each pair. We then 
produce the stream of pairs of these sums of cubes and filter this to find the 
pairs with equal components. The surviving pairs contain the Ramanujan 
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Figure 6. The power of block-diagram design it that we can assemble the 
same components in different ways to produce a variety of systems. 

numbers, which we extract by taking the first component of each pair: 

(define raaanujan-numbers 
(nap first 

(filter equal-coaponents 

(unique-pairs (map sua-of -cubes 

(unique-pairs integers)))))) 
(define (equal-components pair) 
(» (first pair) (second pair))) 

(define (sua-of-cubes pair) 

(+ (cube (first pair)) (cube (second pair)))) 

Streams are data objects that encapsulate the control structure for the 
natural iterations on aggregate data. This allows us to write programs that 
manipulate streams without explicitly representing the details of the control.t 
The decoupling of data flow from control allows the implementer the freedom 
to order the computation in any way that is convenient. Infinite streams work 
because we adopt a lazy evaluation strategy that does no computation until 



T APL was the first major programming language to be organised around functional 
manipulations of aggregate data. As Alan Perils says: "(People tell me] 'APL is 
unstructured. There is no while in it.* I say, 'It's got nothing but wiles', which 
pun they don't get." 
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the result of that computation is required J 

Establishing new languages 

As we confront increasingly complex problems, we will find that any fixed 
language is not sufficient for our needs. We must constantly turn to new lan- 
guages in order to express our ideas more effectively. We can often enhance 
our ability to deal with a complex problem by describing it in a different way, 
using new kinds of primitives, means of combination, and means of abstrac- 
tion. Such a linguitic shift occurs in engineering when one needs to describe 
function rather than structure. The structure of an analog circuit is described 
in terms of circuit elements and modules, but analog engineers also use signal- 
flow graphs to capture the essential functioning of signal-processing systems. 

Sometimes it is necessary to completely change the framework by which 
the descriptions are held together. All of the examples we have shown so 
far have been constructed by functional composition. But many systems are 
only awkwardly described in terms of functions. Physcial systems are often 
described as sets of algebraic constraints among several variables. A constraint 
such as x + yz = 3 is no more a function for computing x given y and z than 
it is a function for computing z given x and y. A language for manipulating 
such descriptions must be fundamentally a language about relations, not a 
language about functions. 

The universality of computation allows us to deal with linguistic shifts 
even of this order, because programming languages can themselves be de- 
scribed in computational terms. We can write interpreters to embed one 
language within another, and we can write compilers to translate from one 
language to another. 

In our subject we illustrate these principles by developing interpreters for 
several languages. We can easily implement an interpreter for the procedural 
language we have been using throughout the subject, and by perturbing this 
interpreter we illustrate semantic variants of the language. We also implement 
a vastly different language — a logic-programming language similar to Prolog. 
In this language, the objects of discourse are relations rather than procedures. 
Primitive relations are assertions that are posited. Relations are combined by 
the logical connectives and, or, and not. Relations are abstracted by giving 
rules that allow one to infer new relations from combinations of others. 

We find that interpreters for these different languages have similar struc- 
tures. Each interpreter has an eval phase that classifies expressions, decom- 
posing compound expressions and recursively interpreting the pieces. There 



* Stream processing was invented by Peter Land in [3]. 
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is also an apply phase that combines the results of interpreting the parts by 
specializing the abstractions to specific situations. 

Lisp 

In teaching our subject we use the Scheme dialect of the programming lan- 
guage Lisp. It supports (but does not enforce) more of the large-scale strate- 
gies of design than any other language we know. We can make procedural 
and data abstractions, we can use higher-order functions to capture common 
patterns of usage, we can model local state using assignment and data mu- 
tation, we can link parts of a program with streams and delayed evaluation, 
and we can easily implement embedded languages. All of this is packaged 
in an interactive environment with support for incremental program design, 
construction, testing, and debugging. 

Summary 

The above examples show how the introductory subject in computer science 
can in fact be an introduction to general principles of engineering design. Ro- 
bust designs are organized into levels of abstractions. Each level is described 
in terms of a language that is appropriately constructed for expressing de- 
signs at that level. Descriptions at each level are closed under combination 
at that level, and we can achieve this by providing a library of standard 
parts with conventional interfaces. Sometimes we must invent vastly different 
frameworks to express a design, and there are well-understood technologies 
for erecting new languages. 

By using computation as a forum for expressing methodology this sub- 
ject sets a tone for the rest of the curriculum. Methodology is rarely taught 
explicitly in classes, or written in texts. With the exception of analytic theo- 
ries, the working knowledge of professionals is almost universally considered 
intrinsically informal, hence unreachable except by experience. If we express 
working knowledge formally, in computational terms, we can manipulate it, 
reflect on it, and transmit it more effectively to students. 
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